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A theory is all the more impressive the simpler its 
premises, the greater the variety of phenomena it 
describes, and the broader its area of application. This is 
the reason for the profound impression made on me by 
classical thermodynamics. It is the only general physical 
theory of which I am convinced that, within its regime 
of applicability, it will never be overturned (this is for 
the special attention of the skeptics in principle). 


Albert Einstein 


To my daughter Birgitta 


Preface to the Second Edition 


In this new edition, supplements, additional explanations and cross references 
have been added in numerous places, including additional problems and re- 
vised formulations of the problems. Figures have been redrawn and the layout 
improved. In all these additions I have pursued the goal of not changing the 
compact character of the book. I wish to thank Prof. W. Brewer for inte- 
grating these changes into his competent translation of the first edition. I am 
grateful to all the colleagues and students who have made suggestions to 
improve the book as well as to the publisher, Dr. Thorsten Schneider and 
Mrs. J. Lenz for their excellent cooperation. 


Munich, December 2005 F. Schwabl 


Preface to the First Edition 


This book deals with statistical mechanics. Its goal is to give a deductive 
presentation of the statistical mechanics of equilibrium systems based on a 
single hypothesis — the form of the microcanonical density matrix — as well 
as to treat the most important aspects of non-equilibrium phenomena. Be- 
yond the fundamentals, the attempt is made here to demonstrate the breadth 
and variety of the applications of statistical mechanics. Modern areas such 
as renormalization group theory, percolation, stochastic equations of motion 
and their applications in critical dynamics are treated. A compact presenta- 
tion was preferred wherever possible; it however requires no additional aids 
except for a knowledge of quantum mechanics. The material is made as un- 
derstandable as possible by the inclusion of all the mathematical steps and 
a complete and detailed presentation of all intermediate calculations. At the 
end of each chapter, a series of problems is provided. Subsections which can 
be skipped over in a first reading are marked with an asterisk; subsidiary 
calculations and remarks which are not essential for comprehension of the 
material are shown in small print. Where it seems helpful, literature cita- 
tions are given; these are by no means complete, but should be seen as an 
incentive to further reading. A list of relevant textbooks is given at the end 
of each of the more advanced chapters. 

In the first chapter, the fundamental concepts of probability theory and 
the properties of distribution functions and density matrices are presented. In 
Chapter 2, the microcanonical ensemble and, building upon it, basic quan- 
tities such as entropy, pressure and temperature are introduced. Following 
this, the density matrices for the canonical and the grand canonical ensemble 
are derived. The third chapter is devoted to thermodynamics. Here, the usual 
material (thermodynamic potentials, the laws of thermodynamics, cyclic pro- 
cesses, etc.) are treated, with special attention given to the theory of phase 
transitions, to mixtures and to border areas related to physical chemistry. 
Chapter 4 deals with the statistical mechanics of ideal quantum systems, in- 
cluding the Bose-Einstein condensation, the radiation field, and superfluids. 
In Chapter 5, real gases and liquids are treated (internal degrees of free- 
dom, the van der Waals equation, mixtures). Chapter 6 is devoted to the 
subject of magnetism, including magnetic phase transitions. Furthermore, 
related phenomena such as the elasticity of rubber are presented. Chapter 7 


xX Preface 


deals with the theory of phase transitions and critical phenomena; following 
a general overview, the fundamentals of renormalization group theory are 
given. In addition, the Ginzburg-Landau theory is introduced, and percola- 
tion is discussed (as a topic related to critical phenomena). The remaining 
three chapters deal with non-equilibrium processes: Brownian motion, the 
Langevin and Fokker—Planck equations and their applications as well as the 
theory of the Boltzmann equation and from it, the H-Theorem and hydrody- 
namic equations. In the final chapter, dealing with the topic of irreversiblility, 
fundamental considerations of how it occurs and of the transition to equilib- 
rium are developed. In appendices, among other topics the Third Law and a 
derivation of the classical distribution function starting from quantum statis- 
tics are presented, along with the microscopic derivation of the hydrodynamic 
equations. 

The book is recommended for students of physics and related areas from 
the 5th or 6th semester on. Parts of it may also be of use to teachers. It is 
suggested that students at first skip over the sections marked with asterisks or 
shown in small print, and thereby concentrate their attention on the essential 
core material. 

This book evolved out of lecture courses given numerous times by the au- 
thor at the Johannes Kepler Universitat in Linz (Austria) and at the Technis- 
che Universitat in Munich (Germany). Many coworkers have contributed to 
the production and correction of the manuscript: I. Wefers, E. Jorg-Miiller, 
M. Hummel, A. Vilfan, J. Wilhelm, K. Schenk, S. Clar, P. Maier, B. Kauf- 
mann, M. Bulenda, H. Schinz, and A. Wonhas. W. Gasser read the whole 
manuscript several times and made suggestions for corrections. Advice and 
suggestions from my former coworkers E. Frey and U. C. Tauber were likewise 
quite valuable. I wish to thank Prof. W. D. Brewer for his faithful translation 
of the text. I would like to express my sincere gratitude to all of them, along 
with those of my other associates who offered valuable assistance, as well as 
to Dr. H. J. Kélsch, representing the Springer-Verlag. 


Munich, October 2002 F. Schwabl 
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1. Basic Principles 


1.1 Introduction 


Statistical mechanics deals with the physical properties of systems which con- 
sist of a large number of particles, i.e. many-body systems, and it is based 
on the microscopic laws of nature. Examples of such many-body systems 
are gases, liquids, solids in their various forms (crystalline, amorphous), liq- 
uid crystals, biological systems, stellar matter, the radiation field, etc. Among 
their physical properties which are of interest are equilibrium properties (spe- 
cific heat, thermal expansion, modulus of elasticity, magnetic susceptibility, 
etc.) and transport properties (thermal conductivity, electrical conductivity, 
etc.). 

Long before it was provided with a solid basis by statistical mechanics, 
thermodynamics had been developed; it yields general relations between the 
macroscopic parameters of a system. The First Law of Thermodynamics was 
formulated by Robert Mayer in 1842. It states that the energy content of a 
body consists of the sum of the work performed on it and the heat which is 
put into it: 


dE = 6Q+6W. (1.1.1) 


The fact that heat is a form of energy, or more precisely, that energy can be 
transferred to a body in the form of heat, was tested experimentally by Joule 
in the years 1843-1849 (experiments with friction). 

The Second Law was formulated by Clausius and by Lord Kelvin (W. 
Thomson!) in 1850. It is based on the fact that a particular state of a ther- 
modynamic system can be reached through different ways of dividing up the 
energy transferred to it into work and heat, i.e. heat is not a “state variable” 
(a state variable is a physical quantity which is determined by the state of the 
system; this concept will be given a mathematically precise definition later). 
The essential new information in the Second Law was that there exists a 
state variable S, the entropy, which for reversible changes is related to the 
quantity of heat transferred by the equation 


' Born W. Thomson; the additional name was assumed later in connection with 
his knighthood, granted in recognition of his scientific achievements. 
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5Q=TdS, (1.1.2) 


while for irreversible processes, 6Q < T'dS holds. The Second Law is identical 
with the statement that a perpetual motion machine of the second kind is 
impossible to construct (this would be a periodically operating machine which 
performs work by only extracting heat from a single heat bath). 

The atomistic basis of thermodynamics was first recognized in the kinetic 
theory of dilute gases. The velocity distribution derived by Maxwell (1831- 
1879) permits the derivation of the caloric and thermal equation of state of 
ideal gases. Boltzmann (1844-1906) wrote the basic transport equation which 
bears his name in the year 1874. From it, he derived the entropy increase 
(A theorem) on approaching equilibrium. Furthermore, Boltzmann realized 
that the entropy depends on the number of states W(E,V,...) which are 
compatible with the macroscopic values of the energy E, the volume V,... as 
given by the relation 


S x logW(E,V,...) . (1.1.3) 


It is notable that the atomistic foundations of the theory of gases were laid 
at a time when the atomic structure of matter had not yet been demon- 
strated experimentally; it was even regarded with considerable scepticism by 
well-known physicists such as E. Mach (1828-1916), who favored continuum 
theories. 

The description of macroscopic systems in terms of statistical ensembles 
was justified by Boltzmann on the basis of the ergodic hypothesis. Fundamen- 
tal contributions to thermodynamics and to the statistical theory of macro- 
scopic systems were made by J. Gibbs (1839-1903) in the years 1870-1900. 

Only after the formulation of quantum mechanics (1925) did the correct 
theory for the atomic regime become available. To distinguish it from clas- 
sical statistical mechanics, the statistical mechanics based on the quantum 
theory is called quantum statistics. Many phenomena such as the electronic 
properties of solids, superconductivity, superfluidity, or magnetism can be 
explained only by applying quantum statistics. 

Even today, statistical mechanics still belongs among the most active 
areas of theoretical physics: the theory of phase transitions, the theory of 
liquids, disordered solids, polymers, membranes, biological systems, granular 
matter, surfaces, interfaces, the theory of irreversible processes, systems far 
from equilibrium, nonlinear processes, structure formation in open systems, 
biological processes, and at present still magnetism and superconductivity 
are fields of active interest. 

Following these remarks about the problems treated in statistical mechan- 
ics and its historical development, we now indicate some characteristic prob- 
lems which play a role in the theory of macroscopic systems. Conventional 
macroscopic systems such as gases, liquids and solids at room temperature 
consist of 10!9-10?° particles per cm®. The number of quantum-mechanical 
eigenstates naturally increases as the number of particles. As we shall see 


1.1 Introduction 3 


Fig. 1.1. Spacing of the energy levels for a 
large number of particles N. 


later, the separation of the energy levels is of the order of e~%, i.e. the en- 
ergy levels are so densely spaced that even the smallest perturbation can 
transfer the system from one state to another one which has practically the 
same energy. 

Should we now set ourselves the goal of calculating the motion of the 3N 
coordinates in classical physics, or the time dependence of the wavefunctions 
in quantum mechanics, in order to compute temporal averages from them? 
Both programs would be impossible to carry out and are furthermore unnec- 
essary. One can solve neither Newton’s equations nor the Schrodinger equa- 
tion for 101°-10? particles. And even if we had the solutions, we would not 
know all the coordinates and velocities or all the quantum numbers required 
to determine the initial values. Furthermore, the detailed time development 
plays no role for the macroscopic properties which are of interest. In addition, 
even the weakest interaction (external perturbation), which would always be 
present even with the best possible isolation of the system from its environ- 
ment, would lead to a change in the microscopic state without affecting the 
macroscopic properties. For the following discussion, we need to define two 
concepts. 

The microstate: it is defined by the wavefunction of the system in quantum 
mechanics, or by all the coordinates and momenta of the system in classical 
physics. 

The macrostate: this is characterized by a few macroscopic quantities (energy, 
volume, ...). 

From the preceding considerations it follows that the state of a macro- 
scopic system must be described statistically. The fact that the system passes 
through a distribution of microstates during a measurement requires that we 
characterize the macrostate by giving the probabilities for the occurrence of 
particular microstates. The collection of all the microstates which represent 
a macrostate, weighted by their frequency of occurrence, is referred to as a 
statistical ensemble. 

Although the state of a macroscopic system is characterized by a statis- 
tical ensemble, the predictions of macroscopic quantities are precise. Their 
mean values and mean square deviations are both proportional to the num- 
ber of particles N. The relative fluctuations, i.e. the ratio of fluctuations to 
mean values, tend towards zero in the thermodynamic limit (see (1.2.21c)). 
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1.2 A Brief Excursion into Probability Theory 


At this point, we wish to collect a few basic mathematical definitions from 
probability theory, in order to derive the central limit theorem.” 


1.2.1 Probability Density and Characteristic Functions 


We first have to consider the meaning of the concept of a random variable. 
This refers to a quantity X which takes on values x depending upon the 
elements e of a “set of events” E. In each individual observation, the value 
of X is uncertain; instead, one knows only the probability for the occurrence 
of one of the possible results (events) from the set EF. For example, in the 
case of an ideal die, the random variable is the number of spots, which can 
take on values between 1 and 6; each of these events has the probability 1/6. 
If we had precise knowledge of the initial position of the die and the forces 
acting on it during the throw, we could calculate the result from classical me- 
chanics. Lacking such detailed information, we can make only the probability 
statement given above. Let e € E be an event from the set F and P, be its 
corresponding probability; then for a large number of attempts, N, the num- 
ber of times N, that the event e occurs is related to P. by limy—o Ae = Ps, 

Let X be a random variable. If the values x which X can assume are con- 
tinuously distributed, we define the probability density of the random variable 
to be w(x). This means that w(a)dzx is the probability that X assumes a value 
in the interval [x, «+ dz]. The total probability must be one, i.e. w(z) is nor- 
malized to one: 


+00 
i dxw(x) =1. (1.2.1) 


—Co 


Definition 1: The mean value of X is defined by 


+00 
(X) = / dx w(x) x . (1.2.2) 


Now let F(X) be a function of the random variable X; one then calls F(X) 
a random function. Its mean value is defined corresponding to (1.2.2) by? 


(F(X)) = / dx w(x) F(x) . (1.2.2) 


The powers of X have a particular importance: their mean values will be 
used to introduce the moments of the probability density. 


? See e.g.: W. Feller, An Introduction to Probability Theory and its Applications, 
Vol. I (Wiley, New York 1968). 

3 In the case that the limits of integration are not given, the integral is to be taken 
from —co to +00. An analogous simplified notation will also be used for integrals 
over several variables. 
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Definition 2: The nth moment of the probability density w(x) is defined as 
Lin = (X") . (1.2.3) 


(The first moment of w(x) is simply the mean value of X.) 
Definition 8: The mean square deviation (or variance) is defined by 


2 
(Ac)? = (X?) — (x)? = ((X —(X))”). (1.2.4) 
Its square root is called the root-mean-square deviation or standard deviation. 
Definition 4: Finally, we define the characteristic function: 
x(k) = ace w(x) = (eX) | (1.2.5) 


By taking its inverse Fourier transform, w(x) can be expressed in terms of 
x(k): 


uae / “ elk (h) . (1.2.6) 


Under the assumption that all the moments of the probability density w(z) 
exist, it follows from Eq. (1.2.5) that the characteristic function is 


Gelaeee 
x(k) = 5° re ae (1.2.7) 
If X has a discrete spectrum of values, i.e. the values €1, €2,... can occur 
with probabilities p;,p2,..., the probability density has the form 
w(a) = pid(a — €1) + ped(a — a) +... . (1.2.8) 


Often, the probability density will have discrete and continuous regions. 

In the case of multidimensional systems (those with several components) 
X = (Xi, Xo,...), let x = (41, 22,...) be the values taken on by X. Then 
the probability density (also called the joint probability density) is w(x) 
and it has the following significance: w(x)dx = w(x)dx,dzr2...dxy is the 
probability of finding x in the hypercubic element x,x+ dx. We will also use 
the term probability distribution or, for short, simply the distribution. 
Definition 5: The mean value of a function F(X) of the random variables X 
is defined by 


(F(X)) = [ exw0oFes : (1.2.9) 


Theorem: The probability density of F(X) 
A function F of the random variables X is itself a random variable, which 
can take on the values f corresponding to a probability density wr(f). The 
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probability density wr(f) can be calculated from the probability density 
w(x). We assert that: 


wr(f) = (6(F(X) — f))- (1.2.10) 


Proof: We express the probability density wr(f) in terms of its characteristic 
function 


wef) = f Pett SO ey 
If we insert (F”) = [dx w(x) F(x)", we find 


dk , ; 
wr(f) = [ee [axw(xjerr 


and, after making use of the Fourier representation of the 6-function 6(y) = 
f&eé, we finally obtain 


ime / dx w(x)6(f — F(x) = (6(F(X) - f)) , 


i.e. Eq. (1.2.10). 
Definition 6: For multidimensional systems we define correlations 


Big = (Xi — (Xa)) (X35 — (X5))) (1.2.11) 


of the random variables X; and X,;. These indicate to what extent fluctuations 
(deviations from the mean value) of X; and X; are correlated. 
If the probability density has the form 


w(x) = wi(vi)w'({re, k A 4h) , 
where w’({xz~,k A i}) does not depend on #;, then Ky; = 0 for j #1, ie. X; 
and X, are not correlated. In the special case 

w(x) = wi(r1)--- ww (en) , 


the stochastic variables X,,..., Xj are completely uncorrelated. 

Let P,(a1,.-.,;2n—1,%n) be the probability density of the random vari- 
ables X1,...,Xn—1, Xn. Then the probability density for a subset of these 
random variables is given by integration of P,, over the range of values of the 
remaining random variables; e.g. the probability density P,—1(a1,...,%n—1) 
for the random variables X1,..., Xn—1 is 


Pn-1(@1,---)2n—1) = [dvnPalcer sate Dy abe By) 


Finally, we introduce the concept of conditional probability and the condi- 
tional probability density. 
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Definition 7: Let P,(#1,...,@n) be the probability (density). The conditional 
probability (density) 


Pyne 014 aries UE |Le41, cane ,Ln) 


is defined as the probability (density) of the random variables 21,..., 2%, if 


the remaining variables 7,41,...,% have given values. We find 
PlGF he 
Pyin—e(®15 +++ Bk |Lkp1y- ++. 2n) = ea ) ; (1.2.12) 
Prk (@e41; wee , Xn) 
where Pec Geri casey) — [dian Gay, ino en) 


Note concerning conditional probability: formula (1.2.12) is usually introduced as a 
definition in the mathematical literature, but it can be deduced in the following way, 
if one identifies the probabilities with statistical frequencies: P,(x1,...,Uk,2K+1; 

.,&n) for fixed xp+1,...,%n determines the frequencies of the 11,...,2% with 
given values of %,41,..-,%n. The probability density which corresponds to these 
frequencies is therefore proportional to Pn(@1,...,%k,;Uk+1,-.-,Xn). Since 
foday...dxpPn(@1,..., Ue, Crpi,---,2n) = Pa—e(@e41,---,%n), the conditional 
probability density normalized to one is then 


Pil GA,345En) 
Pg (txaiy sae ,Ln) E 


Prine (21, 2 de ,Uk|Le41, ie .; In) = 


1.2.2 The Central Limit Theorem 


Let there be mutually independent random variables X1, X2,..., Xn which 
are characterized by common but independent probability distributions w(21), 
w(a2),...,w(ay). Suppose that the mean value and the variance of X), X2,..., 
Xwy exist. We require the probability density for the sum 


Voor ok ad (1.2.13) 


in the limit N — oo. As we shall see, the probability density for Y is given 
by a Gaussian distribution. 
Examples of applications of this situation are 


a) A system of non-interacting particles 
X;, = energy of the i-th particle, Y = total energy of the system 


b) The random walk 
X;, = distance covered in the i-th step, Y = location after N steps. 


In order to carry out the computation of the probability density of Y in 
a convenient way, it is expedient to introduce the random variable Z: 


Z=S°(Xi - (X))/VN = (Y¥ - N(X))/VN, (1.2.14) 


where (X) = (X)) =... = (Xy) by definition. 
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From (1.2.10), the probability density wz(z) of the random variables Z 
is given by 


wz(z) = [te dey w(r)...wlew) 5(2- oo Raeraaaoals + VN(X)) 


VN 
ar skort tN) 4 R/V 
= f Se f dey...doy w(er)...u(en)e 1+ EN) bik N(X) 
TW 
Qn JN ) Des 


where x(q) is the characteristic function of w(x). 

The representation (1.2.7) of the characteristic function in terms of the 
moments of the probability density can be reformulated by taking the loga- 
rithm of the expansion in moments, 


1 5 


x(q) = exp[-ig(X) — 54 (Ae) cntg? Peels (1.2.16) 
i.e. in general 
x(q) = exp Ds Suk Cr] ; (1.2.16’) 


In contrast to (1.2.7), in (1.2.16’) the logarithm of the characteristic function 
is expanded in a power series. The expansion coefficients C,, which occur in 
this series are called cumulants of the nth order. They can be expressed in 
terms of the moments (1.2.3); the three lowest take on the forms: 


Ci = (X) = 1 
C2 = (Az)” = (X?) — (X)? = pe — pe? (42:17) 
C3 = (X*) — 3(X?)(X) + 2(X)" = ps — 3p + 2u3 
The relations (1.2.17) between the cumulants and the moments can be ob- 
tained by expanding the exponential function in (1.2.16) or in (1.2.16’) and 
comparing the coefficients of the Taylor series with (1.2.7). Inserting (1.2.16) 
into (1.2.15) yields 
wz(z) a et a oe ae (1.2.18) 
7 


From this, neglecting the terms which vanish for large N as 1/N or more 
rapidly, we obtain 


1/2 


wz(z) = (2m(Az)”)~ eo” Waar? (1.2.19) 


and finally, using Wy(y)dy = Wz(z)dz for the probability density of the 
random variables Y,, 


1/2 — (v=(*)N)? 
e 


wy (y) = (2nN(Az)*) a Ae)N (1.2.20) 
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This is the central limit theorem: wy(y) is a Gaussian distribution, although 
we did not in any way assume that w(a) was such a distribution, 
mean value: (Y) = N(X) (1.2.21a) 
standard deviation: Ay = ArJ/N (1.2.21b) 
Ay _ Ar/N Ae 
(VY) N(X)  (X)VN 
The central limit theorem provides the mathematical basis for the fact that 
in the limiting case of large N, predictions about Y become sharp. From 


(1.2.21c), the relative deviation, i.e. the ratio of the standard deviation to 
the mean value, approaches zero in the limit of large N. 


relative deviation: (1.2.21c) 


1.3 Ensembles in Classical Statistics 


Although the correct theory in the atomic regime is based on quantum me- 
chanics, and classical statistics can be derived from quantum statistics, it is 
more intuitive to develop classical statistics from the beginning, in parallel to 
quantum statistics. Later, we shall derive the classical distribution function 
within its range of validity from quantum statistics. 


1.3.1 Phase Space and Distribution Functions 


We consider N particles in three dimensions with coordinates qi,...,q3n and 
momenta pi,...,p3n- Let us define phase space, also called I’ space, as the 
space which is spanned by the 6N coordinates and momenta. A microscopic 
state is represented by a point in the I space and the motion of the overall 
system by a curve in phase space (Fig. 1.2), which is also termed a phase- 
space orbit or phase-space trajectory. 

As an example, we consider the one-dimensional harmonic oscillator 


= qo coswt 
eae (1.3.1) 


p=—-—mMagowsinuwt , 


whose orbit in phase space is shown in Fig. 1.3. 

For large N, the phase space is a space of many dimensions. As a rule, our 
knowledge of such a system is not sufficient to determine its position in phase 
space. As already mentioned in the introductory section 1.1, a macrostate 
characterized by macroscopic values such as that of its energy E, volume V, 
number of particles N etc., can be generated equally well by any one of a 
large number of microstates, i.e. by a large number of points in phase space. 
Instead of singling out just one of these microstates arbitrarily, we consider 
all of them, i.e. an ensemble of systems which all represent one and the same 
macrostate but which contains all of the corresponding possible microstates. 
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p 


Fig. 1.2. A trajectory in phase space. Here, q 
q and p represent the 6N coordinates and mo- 
menta qi,...,qQ3n and pi,...,p3n- 


do 
Fig. 1.3. The phase-space orbit of the 
one-dimensional harmonic oscillator. 


The weight with which a point (q,p) = (q@,---,;43N,P1;---;P3N) Occurs at 
the time t is given by the probability density p(q, p, t). 

The introduction of this probability density is now not at all just an ex- 
pression of our lack of knowledge of the detailed form of the microstates, but 
rather it has the following physical basis: every realistic macroscopic system, 
even with the best insulation from its surroundings, experiences an inter- 
action with its environment. This interaction is to be sure so weak that it 
does not affect the macroscopic properties of the system, i.e. the macrostate 
remains unchanged, but it induces the system to change its microstate again 
and again and thus causes it for example to pass through a distribution of 
microstates during a measurement process. These states, which are occu- 
pied during a short time interval, are collected together in the distribution 
p(q,p). This distribution thus describes not only the statistical properties 
of a fictitious ensemble of many copies of the system considered in its di- 
verse microstates, but also each individual system. Instead of considering the 
sequential stochastic series of these microstates in terms of time-averaged 
values, we can observe the simultaneous time development of the whole en- 
semble. It will be a major task in the following chapter to determine the 
distribution functions which correspond to particular physical situations. To 
this end, knowledge of the equation of motion which we derive in the next 
section will prove to be very important. For large N, we know only the prob- 
ability distribution p(q, p,t). Here, 
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3N 


P(qs Ps t)dqdp = p(n, +++)93N,P1,--- , p3n,t) II dqidp; (1.3.2) 
t=1 


is the probability of finding a system of the ensemble (or the individual sys- 
tems in the course of the observation) at time ¢ within the phase-space vol- 
ume element dqdp in the neighborhood of the point g,p in I’ space. p(q, p, t) 
is called the distribution function. It must be positive, p(q,p,t) > 0, and 
normalizable. Here, g,p stand for the whole of the coordinates and momenta 
G1,+++;93N;P1;---,P3Nn- 


1.3.2 The Liouville Equation 


We now wish to determine the time dependence of p(q, p,t), beginning with 
the initial distribution W(qo, po) at time t = 0 on the basis of the classical 
Hamiltonian H. We shall assume that the system is closed. The following re- 
sults are however also valid when H contains time-dependent external forces. 
We first consider a system whose coordinates in phase space at t = 0 are 
qo and po. The associated trajectory in phase space, which follows from the 
Hamiltonian equations of motion, is denoted by q(t; qo, po), p(t; go, Po), with 
the intitial values of the trajectories given here explicitly. For a single trajec- 
tory, the probability density of the coordinates gq and the momenta p has the 
form 


5(q — a(t; G0, Po)) 5(p — p(t; qo, Po)) - (1.3.3) 


Here, 6(k) = 6(ki)...6(k3n). The initial values are however in general not 
precisely known; instead, there is a distribution of values, W(qo, po). In this 
case, the probability density in phase space at the time ¢ is found by multi- 
plication of (1.3.3) by W(q¢o, po) and integration over the initial values: 


p(q,p,t) = fe fac W (qo; po)6(q—4(t; qo, Po)) 5(p—pl(t; qo, po)) - (1.3.3') 


We wish to derive an equation of motion for p(q,p,t). To this end, we use 
the Hamiltonian equations of motion 


OH OH 
ae ph eae 1.3.4 
a= 3, ? Be: (1.3.4) 
The velocity in phase space 
OH OH 
= (q,p) = | —,-— 1.3.4! 
=@=(F-F) (1.3.4) 


fulfills the equation 


par Ogi | ODi\ _ O°? OPH \ _ 
ase (3 a) » ae Opida) 9 en 
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That is, the motion in phase space can be treated intuitively as the “flow” of 
an incompressible “fluid” . 
Taking the time derivative of (1.3.3’), we find 


Op(q, pt) 
ot 
O.., a) 
=- > | aaoapo¥ (40,0) Gi(ts G0, Po) > + Bilt; G0, Po) a 
i Gi Pi 


x 5(q — alt; qo, Po)) 6(p — p(t; qo, Po)) . (1.3.6) 


Expressing the velocity in phase space in terms of (1.3.4), employing the 
6-functions in (1.3.6), and finally using (1.3.3’) and (1.3.5), we obtain the 
following representations of the equation of motion for p(q, p,t): 


Op _ Be ot 5 Ole 


a 


= Op, | Op. 
De & i+ sei) (1.3.7) 


s 3 Op OH | Op OH 
F Og: OD; : Op; OG: ; 


Making use of the Poisson bracket notation‘, the last line of Eq. (1.3.7) can 
also be written in the form 


Op 
i — {H, p} (1.3.8) 


This is the Liouville equation, the fundamental equation of motion of the 
classical distribution function p(q, p,t). 


Additional remarks: 


We discuss some equivalent representations of the Liouville equation and 
their consequences. 


(i) The first line of the series of equations (1.3.7) can be written in abbre- 
viated form as an equation of continuity 


Op . 
ae div up. (1.3.9) 


One can imagine the motion of the ensemble in phase space to be like the 
flow of a fluid. Then (1.3.9) is the equation of continuity for the density and 
Eq. (1.3.5) shows that the fluid is incompressible. 


4 os du a du a 
{u,v} =, [ge ge — 24 22] 
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(ii) We once more take up the analogy of motion in phase space to fluid 
hydrodynamics: in our previous discussion, we considered the density at a 
fixed point q,p in I’ space. However, we could also consider the motion from 
the point of view of an observer moving with the “flow”, i.e. we could ask 
for the time dependence of p(q(t), p(t), £) (omitting the initial values of the 
coordinates, go and po, for brevity). The second line of Eq. (1.3.7) can also 
be expressed in the form 


< o(a(t),(t),t) — aie (1.3.10) 


Hence, the distribution function is constant along a trajectory in phase space. 
(iii) We now investigate the change of a volume element dI in phase space. 
At t = 0, let a number dN of representatives of the ensemble be uniformly 
distributed within a volume element d/. Owing to the motion in phase space, 
they occupy a volume dI° at the time t. This means that the density p at 
t = 0 is given by $7, while at time t, it is 4. From (1.3.10), the equality of 
these two quantities follows, from which we find (Fig. 1.4) that their volumes 
are the same: 


dl =dIp. (1.3.11) 


Equation (1.3.8) is known in mechanics as the Liouville theorem. There, 
it is calculated from the Jacobian with the aid of the theory of canonical 
transformations. Reversing this process, we can begin with Eq. (1.3.11) and 
derive Eq. (1.3.10) and die Liouville equation (1.3.8). 


q 


Fig. 1.4. The time dependence of an element in phase space; its volume remains 
constant. 


°L.D. Landau and E.M. Lifshitz, Course of Theoretical Physics I: Mechanics, 
Eq. (46.5), Pergamon Press (Oxford, London, Paris 1960) 
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1.4 Quantum Statistics 


1.4.1 The Density Matrix for Pure and Mixed Ensembles® 


The density matrix is of special importance in the formulation of quan- 
tum statistics; it can also be denoted by the terms ‘statistical operator’ and 
‘density operator’. 

Let a system be in the state |w). The observable A in this state has the 
mean value or expectation value 


(A) = (| Aly) - (1.4.1) 


The structure of the mean value makes it convenient to define the density 
matrix by 


p=|¥) | . (1.4.2) 
We then have: 


(A) = Tr(pA) (1.4.3a) 
Trp=1, p? =p, pl=p. (1.4.3b,c,d) 


Here, the definition of the trace (Tr) is 


Tr X = $0 (n|X|n) , (1.4.4) 


where {|n)} is an arbitrary complete orthonormal basis system. Owing to 


Tr X = J+ Yo (nfm) (m| X jn) = 52ST (m| X |n) (nlm) 
=> (m|X|m) , 


the trace is independent of the basis used. 
n.b. Proofs of (1.4.3a-c): 
Tr pA = S7 (nlp) (| Alm) = 57 (| A |r) (nib) = ALY) , 


Tr p=Tr pl = (| 1b) =1, p? = |v) (Hd) (| = |) (| =p. 


If the systems or objects under investigation are all in one and the same 
state |), we speak of a pure ensemble, or else we say that the systems are in 
a pure state. 


® See e.g. F. Schwabl, Quantum Mechanics, 3rd edition, Springer, Heidelberg, 
Berlin, New York 2002 (corrected printing 2005), Chap. 20. In the following, 
this textbook will be abbreviated as ‘QM I’. 


1.4 Quantum Statistics 15 


Along with the statistical character which is inherent to quantum-mechanical 


systems, in addition a statistical distribution of states can be present in an 
ensemble. If an ensemble contains different states, we call it a mired ensem- 
ble, a mixture, or we speak of a mixed state. We assume that the state |y1) 
occurs with the probability p1, the state |q,;) with the probability p;,, etc., 
with 


i 
The mean value or expectation value of A is then 


(A) = Pi (bi| Aldi) - (1.4.5) 


This mean value can also be represented in terms of the density matrix defined 
by 


p= Pi i) (Wil - (1.4.6) 
We find: 
(A) =Tr pA (1.4.7a) 
Trp=1 (1.4.7b) 
p?>#p and Tr p? <1, in the case that p; 4 0 for more than one i 
(1.4.7c) 
pi=p. (1.4.7d) 


The derivations of these relations and further remarks about the density 
matrices of mixed ensembles will be given in Sect. 1.5.2. 


1.4.2 The Von Neumann Equation 


From the Schrodinger equation and its adjoint 
nS Wt) = Ht), ih (WA| = (Wt H 
a Ot >) = 9 ’ a Ot >) => >) >) 
it follows that 
a) ‘ , 
thee = ind Pi (> (bil + |v) (dil) 
= So pi (H li) (Wal — i) (il Z) - 
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From this, we find the von Neumann equation, 


Baas St 


oy = HUH pl: (1.48) 


it is the quantum-mechanical equivalent of the Liouville equation. It describes 
the time dependence of the density matrix in the Schrodinger representation. 
It holds also for a time-dependent H. It should not be confused with the 
equation of motion of Heisenberg operators, which has a positive sign on the 
right-hand side. 

The expectation value of an observable A is given by 


(A), = Tr(p(t) A) , (1.4.9) 


where p(t) is found by solving the von Neumann equation (1.4.8). The time 
dependence of the expectation value is referred to by the index t. 

We shall meet up with the von Neumann equation in the next chapter 
where we set up the equilibrium density matrices, and it is naturally of fun- 
damental importance for all time-dependent processes. 

We now treat the transformation to the Heisenberg representation. The 
formal solution of the Schrédinger equation has the form 


|-b(t)) = U(E, to) [B(to)) (1.4.10) 


where U(t, to) is a unitary operator and |7)(to)) is the initial state at the time 
to. From this we find the time dependence of the density matrix: 


p(t) = U(t, to) p(to)U (t,to)* (1.4.11) 


(For a time-independent H, U(t, to) = e~i#(¢-')/”,) 
The expectation value of an observable A can be computed both in the 
Schrédinger representation and in the Heisenberg representation 


(A), = Tr(p(t)A) = Tr(p(to)U(t, to) AU(t, to)) = Tr(p(to)An(t)) . (1.4.12) 
Here, Ay (t) = U'(t, to) AU(t, to) is the operator in the Heisenberg represen- 


tation. The density matrix p(to) in the Heisenberg representation is time- 
independent. 


*1.5 Additional Remarks 


*1.5.1 The Binomial and the Poisson Distributions 


We now discuss two probability distributions which occur frequently. Let 
us consider an interval of length L which is divided into two subintervals 
(0, a] and [a, L]. We now distribute N distinguishable objects (‘particles’) in 


*1.5 Additional Remarks 17 


a completely random way over the two subintervals, so that the probability 
that a particle be found in the first or the second subinterval is given by + 
or (1 — ¢). The probability that n particles are in the interval [0, a] is then 
given by the binomial distribution™ 


m=('-9""C). ns. 


where the combinatorial factor (*) gives the number of ways of choosing n 
objects from a set of N. The mean value of n is 


N 
(n) = Yn = aN (1.5.2a) 
and its mean square deviation is 
An)? =< (1-2)N. 1.5.2 
(An)" = + Zr (1.5.2b) 


We now consider the limiting case D > a. Initially, w, can be written 


> = NW-1)(N—"+) in the form 


ome (SY 0-8) (BR) 
XC ee y)--(l n—1) 


n! (1— 4)” ; 


where for the mean value (1.5.2a), we have introduced the abbreviation 
7m = “In the limit 4 — 0, N — oo for finite 7, the third factor in (1.5.3a) 
becomes e~” and the last factor becomes equal to one, so that for the prob- 


ability distribution, we find: 


using ( 


(1.5.3a) 


=n 


Wn =—e”. (1.5.3b) 


This is the Poisson distribution, which is shown schematically in Fig. 1.5. 
The Poisson distribution has the following properties: 


Sota = (an)? ae (1.5.4a,b,c) 


n 


The first two relations follow immediately from the derivation of the Poisson 
distribution starting from the binomial distribution. They are obtained in 
problem 1.5 together with 1.5.4c¢ directly from 1.5.3b. The relative deviation 


” A particular arrangement with n particles in the interval a and N—n in L—a, 
e.g. the first particle in a, the second in L — a, the third in L — a, etc., has the 


probability (4)” (1 — Nei 
by the number of combinations, i.e. the binomial coefficient ee }. 


. From this we obtain w, through multiplication 
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Wn 


n Fig. 1.5. The Poisson distribution 
is therefore 
An 1 
ao 17 (1.5.5) 


For numbers 7% which are not too large, e.g. 7 = 100, An = 10 and au = TE 
For macroscopic systems, e.g. 7% = 10°, we have An = 10!° and 42 = 107°, 
The relative deviation becomes extremely small. For large 7, the distribution 
Wr is highly concentrated around 7. The probability that no particles at 
all are within the subsystem, i.e. wo = e~!”, is vanishingly small. The 
number of particles in the subsystem [0, a] is not fixed, but however its relative 
deviation is very small for macroscopic subsystems. 


In the figure below (Fig. 1.6a), the binomial distribution for N = 5 and 


4 — 3 (and thus 7 = 1.5) is shown and compared to the Poisson distribu- 
tion for 7 = 1.5; in b) the same is shown for N = 10, # = # (ie. again 
m = 1.5). Even with these small values of N, the Poisson distribution al- 


ready approximates the binomial distribution rather well. With N = 100, 
the curves representing the binomial and the Poisson distributions would 
overlap completely. 


Wn Wn 
N= 5 N = 10 
0.3 a, 3 03. a.3 
Lr 10 L320 
n= tS n=15 


— Binomial distribution 
— Poisson distribution 


0.1 


— Binomial distribution 
—— Poisson distribution 
0.1 


a) b) 


Fig. 1.6. Comparison of the Poisson distribution and the binomial distribution 
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*1.5.2 Mixed Ensembles and the Density Matrix 
of Subsystems 


(i) Proofs of (1.4.7a-d) 


Tr pA= dD (Wi| A |r) (rida) = DP (p;i| A|wi) = (A) . 


From this, (1.4.7b) also follows using A = 1. 


p= 2 >> pip; Wa) (ils) (Wil 4 0 - 
tj 
For arbitrary |w), the expectation value of p 


(b| pb) = DP (ales)? 


is positive definite. Since p is Hermitian, the eigenvalues P,,, of p are positive 
and real: 


p|m) = Pm |m) 


p= = Pn im) (m| , (1.5.6) 


P20, So Pei Gal SO: 
m=1 


In this basis, p? = >, P?, |m) (m| and, clearly, Trp? = >, P?, < 1, if more 
than only one state occurs. One can also derive (1.4.7c) directly from (1.4.6), 
with the condition that at least two different but not necessarily orthogonal 
states must occur in (1.4.6): 


Tr p? = SOD apy (ils) (Waln) (nls) 


n ig 


= Y= asl (bilby)? <n Y pre. 
i,j : F 


(ii) The criterion for a pure or a mixed state is — from Eq. (1.4.3c) and 
(1.4.7c) — given by Tr p? = 1 or Tr p? <1. 

(iii) We consider now a quantum-mechanical system which consists of two 
subsystems 1 and 2. Their combined state is taken to be 


= So en |1n) [2n) , (1.5.7) 
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where more than one c, differs from zero. The associated density matrix is 
given by 


p= |) (| - (1.5.8) 


We now carry out measurements dealing with only subsystem 1, i.e. the 
operators corresponding to the observables A act only on the states |1n). We 
then find for the expectation value 


(A) = Tri TropA = Tri[(Tr2p) Al . (1.5.9) 


Here, Tr; refers to taking the trace over the subsystem 7. According to 
Eq. (1.5.9), the density matrix which determines the outcome of these ex- 
periments is obtained by averaging p over subsystem 2: 


p= Trop = )— en|* [1n) (In| . (1.5.10) 


This is the density matrix of a mixture, although the overall system is in a 
pure state. 
The most general state of the subsystems 1 and 2 has the form® 


I) = So cam |1n) |2m) . (1.5.11) 


Here, again, we find that 


p= Tro |b) (| = > ps Ss CnmCn im (U7) (In| 
E(Eeni) (Lem) 


is in general a mixture. Since a macroscopic system will have spent some time 
in contact with some other systems, even when it is completely isolated, it 
will never be in a pure state, but always in a mixed state. 

It may be instructive to consider the following special case: we write Cym in 
the form Cam = |Cnm|e’?™. In the case that the phases Ynm are stochastic, 
from fp we then obtain the density matrix 


i= TL ([ a2 )a- S (SX leoml?) dt 


{nm) 


(1.5.12) 


8 As an aside, we note that it is possible to introduce a biorthogonal system 
(Schmidt basis) which brings the state (1.5.11) into the form (1.5.7); see QM I, 
problem 20.5. 
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Problems for Chapter 1 


1.1 Prove the Stirling formula 

ale Vine xre”, 
by starting from N! = Sa dx a e~* and fitting the integrand f(x) = «%e~* up to 
second order to the function g(x) = Ae @-NY/a? f(x) has a sharp maximum at 


ro=N. 


1.2 Determine the probability w(N,m) that in a system of N spins exactly m 
will be found to have the orientation “T” and correspondingly N — m have the 
orientation “|”. There is no external magnetic field and no interaction of the spins 
with one another, so that for each individual spin, the configurations 7 and | are 
equally probable. 
(a) Verify 

N 

3 w(N,m)=1. 

m=0 


(b) Calculate the mean value of m, 


and its standard deviation, ((m”) — (my?) */?. 

The dimensionless magnetization is defined as M = 2m — N; give its mean value 
and its standard deviation. 

(c) Calculate the distribution w(N, M) for large N. Assume that |M/N| < 1. 


1.3 Derive the central limit theorem for w;(x;) instead of for w(a;). 
Note: In the result, you need only replace N(X) by So, (Xi) and N(Az)? by 


Di (Aei)?. 


1.4 The random walk: a particle moves at each step with equal probability by a 
unit distance either to the left or to the right. 

(a) Calculate (Y) and (Y”) exactly after N = N,+N-_ steps, where Y = Ny—N_. 
(b) What result would you obtain from the central limit theorem? 


1.5 Verify the relations (1.5.4a-c) for the Poisson distribution, (1.5.3b): 
— , nan integer >0. 


1.6 The distribution function p(£1,...,£~) has the form 


p= |] P(4). 


Let the mean value and the standard deviation of the individual F;-values be de- 
noted by e and ((AEi)?)/? = A, respectively. 
Compute the mean value and the standard deviation of EF = Nei E;. 


1.7 Sketch the trajectory in phase space of a particle 
(a) which moves with the energy E within a one-dimensional, infinitely high po- 
tential well (particle in a box): 
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V(q) 


—a +a 


(b) which falls from a height h under the influence of gravity, is inelastically re- 
flected by the ground and rises again to a height 9/10h, etc. 


0q? Op 


pare with v of Eq. (1.3.4’) and show that |v| =|V H| and v LV H. 


1.8 The gradient of the Hamiltonian is defined by V H(q,p) = (4 a), Com- 


1.9 An ion source emits ions of energy FE; = 5000+ 1.00 eV from a surface of area 
1mm? into a solid angle of 2; = 1 sterad. The ions are accelerated by electric fields 
to Ez = 10 MeV and focused onto an area of 1cm?. Calculate the opening angle of 
the ion beam on impact using the Liouville theorem. 

Suggestion: Assume that the acceleration acts so rapidly that the different ve- 
locities within the beam do not lead to an additional broadening of the beam, 
ie. dvg = dxi; assume also that the width of the energy distribution remains un- 
changed, dE2 = dE}. 


1.10 (a) Show that Tr(AB) = Tr(BA). 

(b) The operators py are taken to be density matrices, so that they obey the 
conditions (1.4.7b-d), and p, > 0, 5°, pp = 1. Show that >>, py py also obeys these 
conditions. 


1.11 Consider a beam of light which is propagating in the +z direction. An arbi- 
trary pure polarization state can be written as a linear combination 


alt) +l), 


where |7) represents the state which is polarized in the x direction and ||) the state 
polarized in the y direction. 

(a) Calculate the density matrix: (i) for an arbitrary pure state, (ii) for the state 
polarized in the x direction, (iii) for the state polarized at 45°, and (iv) for the state 
polarized at 135°. 

(b) What is the density matrix like for a mixed state, where e.g. 50% of the light is 
polarized along 45° and 50% along 135°, or 50% is polarized in the x direction and 
50% in the y direction? The angles are those between the x axis and the direction 
of polarization. 


1.12 A Galton board is a board with nails which is set upright; it has N horizontal 
rows of nails, all the same length, shifted so that the nails of successive rows are 
precisely in between those in the row above. In the center above the uppermost row 
is a funnel through which little balls can be released to fall down through the rows 
of nails. Below the bottom row are a series of compartments which catch the balls. 
What curve is represented by the height of the balls in the various compartments? 


Problems for Chapter 1 23 


1.13 A container of volume V holds N particles. Consider the subvolume v and 
assume that the probability of finding a particular particle in this subvolume is 
given by v/V. 

(a) Give the probability p, of finding n particles within v. 

(b) Calculate the mean value 7 and the mean square deviation (n — 7)?. 

(c) Show with the help of the Stirling formula that p, corresponds approximately 
to a Gaussian distribution when N and n are large. 

(d) Show in the limit — — 0 and V — oo with x = const. that pn approaches a 
Poisson distribution. 


1.14 The Gaussian distribution: The Gaussian distribution is defined by the con- 
tinuous probability density 


1 en (t# 0)? /207 


V2ra? 


For this distribution, compute < X >, Ar, < X*+>, and < X— < X° >>. 


we(x) = 


1.15 The log-normal distribution: Let the statistical variables X have the property 
that log X obeys a Gaussian distribution with < log X >= log zo. 

(a) Show by transforming the Gaussian distribution that the probability density 
for X has the form 


1 1 _ Ges(#/e9))? 
P(x) = —===s—e 22 OSE <OO. 


V2no? & 
(b) Show that 
<X >= aoe? /2 
and 
< log X >= log zo. 


(c) Show that the log-normal distribution can be rewritten in the form 


P(x) = ——= (2/x)7 
(=) = Fas (2/0) 
with 
1 x 
H(z) q2 8 Zi 


it can thus be easily confused with a power law when analyzing data. 
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2.1 Introductory Remarks 


As emphasized in the Introduction, a macroscopic system consists of 10!9 — 
103 particles and correspondingly has an energy spectrum with spacings 
of AE ~ e-N. The attempt to find a detailed solution to the microscopic 
equations of motion of such a system is hopeless; furthermore, the required 
initial conditions or quantum numbers cannot even be specified. Fortunately, 
knowledge of the time development of such a microstate is also superfluous, 
since in each observation of the system (both of macroscopic quantities and 
of microscopic properties, e.g. the density correlation function, particle diffu- 
sion, etc.), one averages over a finite time interval. No system can be strictly 
isolated from its environment, and as a result it will undergo transitions into 
many different microstates during the measurement process. Figure 2.1 il- 
lustrates schematically how the system moves between various phase-space 
trajectories. Thus, a many-body system cannot be characterized by a sin- 
gle microstate, but rather by an ensemble of microstates. This statistical 
ensemble of microstates represents the macrostate which is specified by the 
macroscopic state variables E,V, N,...1 (see Fig. 2.1). 


pa 


. Fig. 2.1. A trajectory in phase space 
q (schematic) 


1 A different justification of the statistical description is based on the ergodic 
theorem: nearly every microstate approaches arbitrarily closely to all the states 
of the corresponding ensemble in the course of time. This led Boltzmann to 
postulate that the time average for an isolated system is equal to the average 
over the states in the microcanonical ensemble (see Sect. 10.5.2). 


26 2. Equilibrium Ensembles 


Experience shows that every macroscopic system tends with the passage 
of time towards an equilibrium state, in which 


a 


7 lH, ol (2.1.1) 


must hold. Since, according to Eq. (2.1.1), in equilibrium the density matrix 
p commutes with the Hamiltonian H, it follows that in an equilibrium en- 
semble p can depend only on the conserved quantities. (The system changes 
its microscopic state continually even in equilibrium, but the distribution of 
microstates within the ensemble becomes time-independent.) Classically, the 
right-hand side of (2.1.1) is to be replaced by the Poisson bracket. 


2.2 Microcanonical Ensembles 


2.2.1 Microcanonical Distribution Functions and Density Matrices 


We consider an isolated system with a fixed number of particles, a fixed 
volume V, and an energy lying within the interval [E, FE + A] with a small 
A, whose Hamiltonian is H(q,p) (Fig. 2.2). Its total momentum and total 
angular momentum may be taken to be zero. 


energy hypersurfaces 


Fig. 2.2. Energy shell in phase space 


We now wish to find the distribution function (density matrix) for this 
physical situation. It is clear from the outset that only those points in phase 
space which lie between the two hypersurfaces H(q,p) = E and H(q,p) = 
E+Acan have a finite statistical weight. The region of phase space between 
the hypersurfaces H(q,p) = FE and H(q,p) = E+ A is called the energy 
shell. It is intuitively plausible that in equilibrium, no particular region of the 
energy shell should play a special role, i.e. that all points within the energy 
shell should have the same statistical weight. We can indeed derive this fact 
by making use of the conclusion following (2.1.1). If regions within the energy 
shell had different statistical weights, then the distribution function (density 
matrix) would depend on other quantities besides H(q,p), and p would not 
commute with H (classically, the Poisson bracket would not vanish). Since 


2.2 Microcanonical Ensembles 27 


for a given £,A,V, and N, the equilibrium distribution function depends 
only upon H(q,p), it follows that every state within the energy shell, i-e. all 
of the points in I’ space with E < H(q,p) < E+ A, are equally probable. 
An ensemble with these properties is called a microcanonical ensemble. The 
associated microcanonical distribution function can be postulated to have the 
form 


duc = (2.2.1) 


TEA BS A(g,p) Ss B+ A 
0 otherwise , 

where, as postulated, the normalization constant 2(E£) depends only on E, 

but not on gq and p. 2(E)A is the volume of the energy shell.” In the limit 

A — 0, (2.2.1) becomes 


Puc = aH O(E =~ H(q,p)) : (2.2.1') 


The normalization of the probability density determines (2 (£): 


dq dp 
=n Puc =1. (2.2.2) 


The mean value of a quantity A is given by 
dq dp 
(A) = i 7BN NI PucA. (2.2.3) 


The choice of the fundamental integration variables (whether q or g/const) is 
arbitrary at the present stage of our considerations and was made in (2.2.2) 
and (2.2.3) by reference to the limit which is found from quantum statistics. If 
the factor (h3?“ N!)~1 were not present in the normalization condition (2.2.2) 
and in the mean value (2.2.3), then py;¢ would be replaced by (h?% N!)~! pyre. 
All mean values would remain unchanged in this case; the difference however 
would appear in the entropy (Sect. 2.3). The factor 1/N! results from the 
indistinguishability of the particles. The necessity of including the factor 1/N! 
was discovered by Gibbs even before the development of quantum mechanics. 
Without this factor, an entropy of mixing of identical gases would erroneously 
appear (Gibbs’ paradox). That is, the sum of the entropies of two identical 
ideal gases each consisting of N particles, 25;, would be smaller than the 
entropy of one gas consisting of 2N particles. Mixing of ideal gases will be 
treated in Chap. 3, Sect. 3.6.3.4. We also refer to the calculation of the entropy 
of mixtures of ideal gases in Chap. 5 and the last paragraph of Appendix B.1. 


? The surface area of the energy shell Q(E) depends not only on the energy E 
but also on the spatial volume V and the number of particles N. For our present 
considerations, only its dependence on FE is of interest; therefore, for clarity and 
brevity, we omit the other variables. We use a similar abbreviated notation for the 
partition functions which will be introduced in later sections, also. The complete 
dependences are collected in Table 2.1. 
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For the 6N-dimensional volume element in phase space, we will also use 
the abbreviated notation 


_ dqdp 
dl’ = FanNI - 


From the normalization condition, (2.2.2), and the limiting form given in 
(2.2.1), it follows that 


dq dp 
2(£) = / 73N NI 6(E— H(q,p)) (2.2.4) 
After introducing coordinates on the energy shell and an integration variable 
along the normal k, , (2.2.4) can also be given in terms of the surface integral: 


dS 
E 


ft cas 1 
J WBNN' |VH(q,p)| 


(2.2.4') 


Here, dS is the differential element of surface area in the (6N —1)-dimensional 
hypersurface at energy EF, and V is the 6N-dimensional gradient in phase 
space. In Eq. (2.2.4) we have used H(S'z) = E and performed the integration 
over k,. According to Eq. (1.3.4’), it holds that |W H(q,p)| = |v| and the 
velocity in phase space is perpendicular to the gradient, i.e. v L V H(q,p). 
This implies that the velocity is always tangential to the surface of the energy 
shell; cf. problem 1.8. 


Notes: 


(i) Alternatively, the expression (2.2.4’) can be readily proven by starting with 
an energy shell of finite width A and dividing it into segments dSAk,. Here, dS 
is a surface element and Ak, is the perpendicular distance between the two hy- 
persurfaces (Fig. 2.3). Since the gradient yields the variation perpendicular to an 
equipotential surface, we find |V H(q,p)|Ak. = A, where V H(q,p) is to be com- 
puted on the hypersurface H(q,p) = E. 


Ak dS 


Fig. 2.3. Calculation of the vol- 
ume of the energy shell 
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From this it follows that 


dS 
B)A= | orn hs = | PNNIVA Gp 


i.e. we again obtain (2.2.4’). 


(ii) Equation (2.2.4’) has an intuitively very clear significance. 2 (F) is given by the 
sum of the surface elements, each divided by the velocity in phase space. Regions 
with high velocity thus contribute less to 2 (E). In view of the ergodic hypothesis 
(see 10.5.2), this result is very plausible. See problem 1.8: |v| = |W H| and v L VA. 


As already mentioned, 2 (E)A is the volume of the energy shell in classical 
statistical mechanics. We will occasionally also refer to 2 (E) as the “phase 
surface”. We also define the volume inside the energy shell: 


= dq dp 
2(E) = ‘] 73N NI O(E—H(q,p)) - (2.2.5) 
Clearly, the following relation holds: 
dQ(E) 
2(£) = — (2.2.6) 


Quantum mechanically, the definition of the microcanonical ensemble for an 
isolated system with the Hamiltonian H and associated energy eigenvalues 
Ey, is: 


pac = >) P(En) |r) (n|_, (2.2.7) 


where, analogously to (2.2.1), 


E<E,<E+A 


1 
E,) =< &(4 a ee 2.2.8 
P(En) i‘ otherwise . ( ) 


In the microcanonical density matriz prc, all the energy eigenstates |n) whose 
energy E,, lies in the interval [F, E + A] contribute with equal weights. The 
normalization 


Tr pac = 1 (2.2.9a) 


yields 


E)= =i (2.2.9b) 


where the summation is restricted to energy eigenstates within the energy 
shell. Thus 2(£)A is equal to the number of energy eigenstates within the 
energy shell [E, E + Al]. For the density matrix of the microcanonical ensem- 
ble, an abbreviated notation is also used: 


puc = 2(E)16(H — E) (2:2:7) 
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and 
Q (EF) =Tr 6(H- E). (2.2.9b’) 


Equation (2.2.8) (and its classical analogue (2.2.1)) represent the fundamental 
hypothesis of equilibrium statistical mechanics. All the equilibrium properties 
of matter (whether isolated or in contact with its surroundings) can be de- 
duced from them. The microcanonical density matrix describes an isolated 
system with given values of E,V, and N. The equilibrium density matrices 
corresponding to other typical physical situations, such as those of the canon- 
ical and the grand canonical ensembles, can be derived from it. As we shall 
see in the following examples, in fact essentially the whole volume within the 
hypersurface H(q,p) = E lies at its surface. More precisely, comparison of 
Q(B) and 2(E)A shows that 


log(2(B)A) = log 2(B) + O(log =) . 


Since log Q(E)A and log 2Q(E) are both proportional to N, the remaining 
terms can be neglected for large N; in this spirit, we can write 


N(E)A= Q(B). 


2.2.2 The Classical Ideal Gas 


In this and in the next section, we present three simple examples for which 
§2(E) can be calculated, and from which we can directly read off the char- 
acteristic dependences on the energy and the particle number. We shall now 
investigate the classical ideal gas, i.e. a classical system of N atoms between 
which there are no interactions at all; and we shall see from it how 2 (FE) 
depends on the energy F and on the particle number N. Furthermore, we 
will make use of the results of this section later to derive the thermodynamics 
of the ideal gas. The Hamiltonian of the three-dimensional ideal gas is 
Nea 
H=S —++4+Voaui - (2.2.10) 
é m 


Here, the p; are the cartesian momenta of the particles and Vyay is the 
potential representing the wall of the container. The surface area of the energy 
shell is in this case 


‘ 2 
2(B)= ag [in fatew fat... [dry s(e- > Pr), 
Vv Vv 


t=1 


(2.2.11) 
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where the integrations over x are restricted to the spatial volume V defined 
by the walls. It would be straightforward to calculate 2 (E) directly. We shall 
carry out this calculation here via (E), the volume inside the energy shell, 
which in this case is a hypersphere, in order to have both quantities available: 


= 1 
P(E) = sarG 


ee af OR tte Lf é py OE ~ 2 Pi/am) . (2.2.12) 


Introducing the surface area of the d-dimensional unit sphere,? 


7/2 
P(d/2) ” 


we find, representing the momenta in spherical polar coordinates, 


(Qr)¢’Ky= [an = (2;2.13) 


V2mME 


Q(E) = Lae dQ dp p>\~1 
= 73N NI 3N Pp 
0 


From this, we immediately obtain 
: (2.2.14) 


where I" (32) = (3x — 1)! was used, under the assumption — without loss of 
generality — of an even number of particles. For large N, Eq. (2.2.14) can be 
simplified by applying the Stirling formula (see problem 1.1). 


N!~ NNe-N(2nN)1/? , (2.2.15) 


whereby it suffices to retain only the first two factors, which dominate the 
expression. Then 
N 3N 
5 V AnmE \ ? 
ees (x) (Fr) z 
Making use of Eq. (2.2.6), we obtain from (2.2.14) and (2.2.16) the exact 
result for Q(E): 


or 
we 


(2.2.16) 


SNA, 
VN Qnm (2rmE) 2 


h3N NI(3% — 1)! 


Q(E)= (2247) 


as well as an asymptotic expression which is valid in the limit of large N: 


3 The derivation of (2.2.13) will be given at the end of this section. 
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N 3N 
V 4nmE\ 2? sw 1 3N 
Q(E) = (= —— 2, 2.2.18 

©) (+) (Far) °" EB 2 ea 
In (2.2.16) and (2.2.18), the specific volume V/N and the specific energy 
E/N occur to the power N. We now compare 92(E), the volume inside the 
energy shell, with 2 (£)A, the volume of a spherical shell of thickness A, by 


considering the logarithms of these two quantities (due to the occurrence of 
the Nth powers): 


log (2 (E)A) = log 2(B) + O(log (2.2.19) 


- 
NA/~ 
Since log Q(E)A and log 2Q(E) are both proportional to N, the remaining 


terms can be neglected in the case that N is large. In this approximation, we 
find 


Q(B)A x AE), (2.2.20) 


i.e. nearly the whole volume of the hypersphere H(q,p) < E lies at its surface. 
This fact is due to the high dimensionality of the phase space, and it is to be 
expected that (2.2.20) remains valid even for systems with interactions. 


We now prove the expression (2.2.13) for the surface area of the d-dimensional 
unit sphere. To this end, we compute the d-dimensional Gaussian integral 


I= fan. f ava eT Pit tPA) = (Sm), (2.2.21) 


This integral can also be written in spherical polar coordinates:* 


I =4 app" f dae” = 5 fatette fara = sr(5) fac, (2.2.22) 
0 


P(z)= pare (2.2.23) 
0 


is the gamma function. Comparison of the two expressions (2.2.21) and (2.2.22) 
yields 


d/2 
[2% = ee, (2.2.13’) 


In order to gain further insights into how the volume of the energy shell de- 
pends upon the parameters of the microcanonical ensemble, we will calculate 


4 We denote an element of surface area on the d-dimensional unit sphere by dQQa. 
For the calculation of the surface integral [ dQq, it is not necessary to use the 
detailed expression for d(QQg. The latter may be found in E. Madelung, Die Math- 
ematischen Hilfsmittel des Physikers, Springer, Berlin, 7th edition (1964), p. 244. 
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92 (E) for two other simple examples, this time quantum-mechanical systems; 
these are: (i) harmonic oscillators which are not coupled, and (ii) paramag- 
netic (not coupled) spins. Simple problems of this type can be solved for 
all ensembles with a variety of methods. Instead of the usual combinatorial 
method, we employ purely analytical techniques for the two examples which 
follow. 


*2.2.3 Quantum-mechanical Harmonic Oscillators 
and Spin Systems 


*2.2.3.1 Quantum-mechanical Harmonic Oscillators 


We consider a system of N identical harmonic oscillators, which are either not 
coupled to each other at all, or else are so weakly coupled that their interactions 
may be neglected. Then the Hamiltonian for the system is given by: 


N 
i 
= t 
ic ae (ala, - 5) , (2.2.24) 


where al (a;) are creation (annihilation) operators for the jth oscillator. Thus we 
have 


Q (EF) = ped 6(H pal nj + 3)) 


ev ikhw/2 
/< eit ( E-N); hw(nj+5 oo /< alee 
l-—e- ikhw ? 


(2.2.25) 


and finally 


Q(B) = f en tnen- log(2i sin(khw/2))) _ (2.2.26) 
us 


The computation of this integral can be carried out for large N using the saddle- 
point method.° The function 


f(k) = ike — log (2isin(khw/2)) (2.2.27) 


with e = E/N has a maximum at the point 


(2.2.28) 


This maximum can be determined by setting the first derivative of (2.2.27) equal 
to zero 


°N.G. de Bruijn, Asymptotic Methods in Analysis, (North Holland, 1970); 
P.M. Morse and H. Feshbach, Methods of Theoretical Physics, p. 434, (McGraw 
Hill, New York, 1953). 
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f' (ko) = ie me cot role =0. 


Therefore, with 


f(ko) = ikoe — log (2i/V1 = (2e/hus)?) 


" 2.2.29) 
e e+ i hw hw 2 ( 
i os + Sloe ( (e+ 5) (e- ) / (wy?) 


2 


and f” (ko) = (22)? / sin? (kohw/2), we find for 2 (E): 


Q(B) = = oN itt) Papeh a ey (2.2.30) 


The integral in this expression yields only a factor proportional to VN; thus, the 
number of states is given by 


e+ thw e+ thw e— thw e— thw 
Q(E) = N 2 | 2 a | 2 . (2.2.31 
©) exp { fw fw fw iw |} Gas) 


*2.2.3.2 Two-level Systems: the Spin- Paramagnet 


As our third example, we consider a system of N particles which can occupy one 
of two states. The most important physical realization of such a system is a para- 
magnet in a magnetic field H (h = —peH), which has the Hamiltonian® 


N 
KH=-hY oi, with of =+1. (2.2.32) 


i=1 


The number of states of energy EF is, from (2.2.1), given by 


N 
Q (EF) = >. s(E+hyra)= [SF S~ eik(B+h Dy; 0%) 
4=1 


{o;=41} {o;=+1} (2.2.33) 
= / a e* (2coskh)™ = 2% / a ef *) 
with 
f(k) =ikE + Nlogcoskh. (2.2.34) 


The computation of the integral can again be accomplished by applying the saddle- 
point method. Using f’(k) = iE—Nhtan kh and f”(k) = —Nh?/ cos? kh, we obtain 


® In the literature of magnetism, it is usual to denote the magnetic field by H or 
H. To distinguish it from the Hamiltonian in the case of magnetic phenomena, 
we use the symbol H for the latter. 
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from the condition f’(ko) = 0 


= iF i 1+E/Nh 
koh = arctan Wh 3 log 1— B/N. 


For the second derivative, we find 
f’ (ko) =—(1—(E/Nh)?)Nh? <0 for —Nh<E<Nh. 
Thus, using the abbreviation e = E'/Nh, we have 


Q(E) = Qn exp (-48 log bane + N log u ) dk on 3 (-F"" (ko) (Ko)? 
ae = 


oN Ne, 1l+e N 1 1 acces 
a ] ] ] 1 Nh 
rE ( 2 ine o Sia a eee) 
1 l+te wN l—e 
= 1+e)l 1 ] 
Fee OP — FOL + los SF - FA — eos 
! log(1 — e”) log nn} F 
2(E) = exp {-2 [1 + e)tog FE clog +] o(1,0e.x)} : 


(2.2.35) 


We have now calculated the number of states (2 (E) for three examples. The physical 
consequences of the characteristic energy dependences will be discussed after we 
have introduced additional concepts such as those of entropy and temperature. 


2.3 Entropy 


2.3.1 General Definition 
Let an arbitrary density matrix p be given; then the entropy S is defined by 
S = —kTr (plog p) = —k(log p) . (2.3.1) 


Here, we give the formulas only in their quantum-mechanical form, as we 
shall often do in this book. For classical statistics, the trace operation Tr is 
to be read as an integration over phase space. The physical meaning of S 
will become clear in the following sections. At this point, we can consider 
the entropy to be a measure of the size of the accessible part of phase space, 
and thus also of the uncertainty of the microscopic state of the system: the 
more states that occur in the density matrix, the greater the entropy S. For 
example, for M states which occur with equal probabilities va the entropy 
is given by 


eles 32 ail 
S=—k) yj los gz = blogM . 
1 
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For a pure state, M = 1 and the entropy is therefore S = 0. In the diagonal 
representation of p (Eq. 1.4.8), one can immediately see that the entropy is 
positive semidefinite: 


S=—-k)~ Pylog Py > 0 (2.3.2) 


since zlogx < 0 in the interval 0 < x < 1 (see Fig. 2.4). The factor & in 
(2.3.1) is at this stage completely arbitrary. Only later, by identifying the 
temperature scale with the absolute temperature, do we find that it is then 
given by the Boltzmann constant k = 1.38 x 107! erg/K = 1.38 x 10~73J/K. 
See Sect. 3.4. The value of the Boltzmann constant was determined by Planck 
in 1900. 


The entropy is also a measure of the disorder and of the lack of information content 
in the density matrix. The more states contained in the density matrix, the smaller 
the weight of each individual state, and the less information about the system one 
has. Lower entropy means a higher information content. If for example a volume 
V is available, but the particles remain within a subvolume, then the entropy is 
smaller than if they occupied the whole of V. Correspondingly, the information 
content ( « Tr plogp) of the density matrix is greater, since one knows that the 
particles are not anywhere within V, but rather only in the subvolume. 


2.3.2 An Extremal Property of the Entropy 


Let two density matrices, p and pi, be given. The important inequality 


Tr (p(log pi — log p)) <0. (2.3.3) 


then holds. To prove (2.3.3), we use the diagonal representations of p = 


T,, Pa in} (n| and pr = 3, Pu, V) (ul 
Tr (p(log pi — log p)) = S> Ph (n| (log pi — log Py) |n) = 
= 5 Palo Bem) = Pala) og "(ol = 
<UUPA (n|v) oi a) on ) (vln) = Pan inl (E -1) |n)= 


en ee 


In an intermediate step, we used the basis |v) of p, as well as the inequality 
log x < «—1. This inequality is clear from Fig. 2.4. Formally, it follows from 
properties of the function f(r) = logx —a +1: 


fa=0, f'ay=0, f"(a)= -4 <0 (ie. f(z) is convex). 
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x-1 


Fig. 2.4. Illustrating the inequality loga < 
a—-1 


2.3.3 Entropy of the Microcanonical Ensemble 
For the entropy of the microcanonical ensemble, we obtain by referring to 
(2.3.1) and (2.2.7) 


1 
Smo = —kTr (pace 10g purc) = —kTr (pare log TEA) 


and, since the density matrix is normalized to 1, Eq. (2.2.9a), the final result: 
Suc = klog(Q(E)A) . (2.3.4) 


The entropy is thus proportional to the logarithm of the accessible phase 
space volume, or, quantum mechanically, to the logarithm of the number of 
accessible states. 

We shall now demonstrate an interesting extremal property of the entropy. 
Of all the ensembles whose energy lies in the interval [F, F + A], the entropy 
of the microcanonical ensemble is greatest. To prove this statement, we set 
P1 = Puc in (2.3.3) and use the fact that p, like pyc, differs from zero only 
on the energy shell 


1 
Slo] < —k Tr (p log pc) = —kTr C log aa) = Suc - (2.3.5) 
Thus, we have demonstrated that the entropy is maximal for the microcanon- 
ical ensemble. We note also that for large N, the following representations of 
the entropy are all equivalent: 


Suc = klog Q(E)A = klog Q(E)E = klog Q(E) . (2.3.6) 


This follows from the neglect of logarithmic terms in (2.2.19) and an analo- 
gous relation for 2 (E)E. 

We can now estimate the density of states. The spacing AF of the energy 
levels is given by 


AE SS A . e7 SMc/k ~N A . e NX Fe (2.3.7) 


38 2. Equilibrium Ensembles 


The levels indeed lie enormously close together, i.e. at a high density, as 
already presumed in the Introduction. For this estimate, we used 


S=klogQ(L)A« N; 


this can be seen from the classical results, (2.2.18) as well as (2.2.31) and 
(2.2.35). 


2.4 Temperature and Pressure 


The results for the microcanonical ensemble obtained thus far permit us to 
calculate the mean values of arbitrary operators. These mean values depend 
on the natural parameters of the microcanonical ensemble, E,V, and N. 
The temperature and pressure have so far not made an appearance. In this 
section, we want to define these quantities in terms of the energy and volume 
derivatives of the entropy. 


2.4.1 Systems in Contact: the Energy Distribution Function, 
Definition of the Temperature 


We now consider the following physical situation: let a system be divided 
into two subsystems, which interact with each other, i.e. exchange of energy 
between the two subsystems is possible. The overall system is isolated. The 
division into two subsystems 1 and 2 is not necessarily spatial. Let the Hamil- 
tonian of the system be H = H, + H2+ W. Let further the interaction W 
be small in comparison to H; and H2. For example, in the case of a spatial 
separation, the surface energy can be supposed to be small compared to the 
volume energy. The interaction is of fundamental importance, in that it al- 
lows the two subsystems to exchange energy. Let the overall system have the 
energy E, so that it is described by a microcanonical density matrix: 


Puc = %1,2(E)~*6(Hi+Ho+W —E) = 2) 2(E)~16(H, + He—E) . (2.4.1) 


Here, W was neglected relative to H; and Hz, and §2;,2 (E£) is the phase-space 
surface of the overall system with a dividing wall (see remarks at the end of 
this section). 


\ 


Fig. 2.5. An isolated system divided into subsys- 
tems 1 and 2 separated by a fixed diathermal wall 
(which permits the exchange of thermal energy) 


SSN 
\B\ 
N= 
VE 
\N 
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w (FE) denotes the probability density for subsystem 1 to have the energy 
E,. According to Eq. (1.2.10), w (£1) is given by 


w (£1) = (6(Ai — £1)) 
= ae 19(E)~*6(My + He — E)é(Mi — E1) 


_ 2o(B = By)(E1) 
%,3(E) 


(2.4.2a) 


Here, (2.4.1) was used and we have introduced the phase-space surfaces of 
subsystem 1, 2;(£,) = f dI, 6(H, — E;), and subsystem 2, 22(E — E\) = 
J dI2 6(Hz — E+ E;). The most probable value of E,, denoted as E,, can be 
found from wey) = 0: 


=0. 


1 


(-%(E — E,)Q\(E,) + 22(£ — Ey) 2%(E1)) 


Using formula (2.3.4) for the microcanonical entropy, we obtain 


) 0 
a S(Ee)| 4g = 5 SUE 2.4.3 
DBs 2(Ba)) DB, (Ei) (2.4.3) 
We now introduce the following definition of the temperature: 
s) 
T= —S(E). 2.4.4 
a 5(B) (2.4.4) 
Then it follows from (2.4.3) that 
Ty =T2. (2.4.5) 


In the most probable configuration, the temperatures of the two subsystems 
are equal. We are already using partial derivatives here, since later, several 
variables will occur. For the ideal gas, we can see immediately that the tem- 
perature increases proportionally to the energy per particle, T x E/N. This 
property, as well as (2.4.5), the equality of the temperatures of two systems 
which are in contact and in equilibrium, correspond to the usual concept of 
temperature. 


Remarks: 


The Hamiltonian has a lower bound and possesses a finite smallest eigenvalue 
Eo. In general, the Hamiltonian does not have an upper bound, and the 
density of the energy eigenvalues increases with increasing energy. As a result, 
the temperature cannot in general be negative, (T > 0), and it increases with 
increasing energy. For spin systems there is also an upper limit to the energy. 
The density of states then again decreases as the upper limit is approached, 
so that in this energy range, 2’/Q < 0 holds. Thus in such systems there 
can be states with a negative absolute temperature (see Sect. 6.7.2). Due to 
the various possibilities for representing the entropy as given in (2.3.6), the 


temperature can also be written as T = (k-4 log QE)". 
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Notes concerning §21,2(£) in Eq. (2.4.1); may be skipped over in a first reading: 


(i) In (2.4.1 and 2.4.2a), it must be taken into account that subsystems 1 and 2 
are separated from each other. The normalization factor 921,2(£) which occurs in 
(2.4.1) and (2.4.2a) is not given by 


dq dp 
pars E)= / ABN NI O(H — E)=Q(E), 
but instead by 


dq dpi dq2 dp2 
Ny!h3N1 No!h3N2 


21,2(E) = fariars 6(H E) = O(H E) 


= fas f arvars 5(H — Bon ~ F,) 
(2.4.2b) 
— [ass [ arar, 6(H2 —E+ E,)6(Ay — E;) 


= far, 22) (1) 22(E — Fy). 


(ii) Quantum mechanically, one obtains the same result for (2.4.2a): 


1 
21,2(E) 


w (Ei) = (8(H — Ex) = Tr ( 6(Hi + Ha ~ E)8(Hy ~ E1)) 


ie ities (aay (E ~ B,))6(Hh ~ E1)) 
_ Os(E1)O2(B — Es) 
Qr9(E) 


and 
21,2(E) = 1 6(Ay + Hs — E) = paw Tr (6( Ai + Ho — E)6(Ay — E,)) 
= aes Tr (6(H2 — E+ E1)5(Hi — E1)) = aes 1 (1) Q2(EB — Ey) . 


Here, we have used the fact that for the non-overlapping subsystems 1 and 2, the 
traces Tr; and Tr 2 taken over parts 1 and 2 are independent, and the states must 
be symmetrized (or antisymmetrized) only within the subsystems. 


(iii) We recall that for quantum-mechanical particles which are in non-overlapping 
states (wavefunctions), the symmetrization (or antisymmetrization) has no effect 
on expectation values, and that therefore, in this situation, the symmetrization 
does not need to be carried out at all.” More precisely: if one considers the matrix 
elements of operators which act only on subsystem 1, their values are the same 
independently of whether one takes the existence of subsystem 2 into account, or 
bases the calculation on the (anti-)symmetrized state of the overall system. 


” See e.g. G. Baym, Lectures on Quantum Mechanics (W.A. Benjamin, New York, 
Amsterdam 1969), p. 393 
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2.4.2 On the Widths of the Distribution Functions of Macroscopic 
Quantities 


2.4.2.1 The Ideal Gas 


For the ideal gas, from (2.2.18) one finds the following expression for the 
probability density of the energy FE, Eq. (2.4.2a): 


w (Ey) « (E1/N1)2%1/? (By /N2)3%2/2 . (2.4.6) 


In equilibrium, from the equality of the temperatures [Eq. (2.4.3)], ie. from 


one = oa , we obtain the condition om = =< =, and thus 
= Ni 
EL, = E——_.. 2.4.7 
+ "Ni + Np oe 


If we expand the distribution function w (£,) around the most probable en- 


ergy value £1, using fe 


quadratic term, we find 


&, — 0 and terminating the expansion after the 
1 


~ 1 3 N. 3 N. 
log w (Fy) = logw(F) 4 ( : - 


~ \2 
2\" 2 Re aes) (Bi an 


and therefore 


x —3 Ni+N2 (pF B12 ~ 23 _N (Bi =F)? 
w(E1) = w(Ey)e 4 fee (MEY _ (Bye imme BY (9 4.8) 
where Be + @ - him + hm ~ Ris and é = E/N were used. Here, 


log w (F) rather than w (£1) was expanded, because of the occurrence of the 
powers of the particle numbers N, and No in Eq. (2.4.6). This is also prefer- 
able since it permits the coefficients of the Taylor expansion to be expressed 
in terms of derivatives of the entropy. From (2.4.8), we obtain the relative 
mean square deviation: 


(Can. £9 iB SAN 


FE? ~ B23(Ni+N2) 3N MN 


10-79 (2.4.9) 


and the relative width of the distribution, with Nj ~ Nj, 


BE Be (2.4.10) 
Ey VN 
For macroscopic systems, the distribution is very sharp. The most prob- 
able state occurs with a stupendously high probability. The sharpness of the 
distribution function becomes even more apparent if one expresses it in terms 
of the energy per particle, e; = £1 /N1, including the normalization factor: 


[3 NN 2% (a)? 
We, (e1) = es ue ree) ‘ 
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2.4.2.2 A General Interacting System 


For interacting systems it holds quite generally that: 
An arbitrary quantity A, which can be written as a volume integral over a 
density A(x), 


A= [ @xAco) , (2.4.11) 
Vv 
Its average value depends on the volume as 
(A) = [@x(aco) wv. (2.4.12) 
Vv 


The mean square deviation is given by 
(A4)* = ((A- (A)) (4 (4))) 
de i. dx if dx! (( A(x) = (A(x))) (A@’) 2 (A(x'))) ) x VE. 
= (2.4.13) 


Both the integrals in (2.4.13) are to be taken over the volume V. The cor- 
relation function in the integral however vanishes for |x — x’| > 1, where | is 
the range of the interactions (the correlation length). The latter is finite and 
thus the mean square deviation is likewise only of the order of V and not, as 
one might perhaps naively expect, quadratic in V. The relative deviation of 
A is therefore given by 


AA 1 
ee 2.4.14 


2.4.3 External Parameters: Pressure 


Let the Hamiltonian of a system depend upon an external parameter a: 
H = H(a). This external parameter can for example be the volume V of 
the system. Using the volume in phase space, §2, we can derive an expression 
for the total differential of the entropy dS. Starting from the phase-space 
volume 


Q(E,a) = fu O(E- H(a)) , (2.4.15) 


we take its total differential 


2.4 Temperature and Pressure 43 


: OH 
d@(B,a) = far 5(E — H(a)) (az - <a) 
oH 
= 2(E,a) (aE = (S_)aa) , (2.4.16) 
or 
ae, oH 
dlog 2 = A (de a (5 )aa) (2.4.17) 
We now insert S(E,a) = klog 2(E,a) and (2.4.4), obtaining 
1 oH 
dS = = (aE = (5—)aa) , (2.4.18) 


From (2.4.18), we can read off the partial derivatives of the entropy in 
terms of FE and a:° 


Cj ad ee ass) 
OE), T Oa / T \ 0a 
Introduction of the pressure (special case: a = V): 

After the preceding considerations, we can turn to the derivation of pres- 
sure within the framework of statistical mechanics. We refer to Fig. 2.6 as 
a guide to this procedure. A movable piston at a distance L from the origin 
of the coordinate system permits variations in the volume V = LA, where 
A is the cross-sectional area of the piston. The influence of the walls of the 
container is represented by a wall potential. Let the spatial coordinate of the 
ith particle in the direction perpendicular to the piston be x;. Then the total 
wall potential is given by 


N 
Vera = 5) (a; —L). (2.4.20) 


i=1 


zzz 2D DIZ Fig. 2.6. The definition of pressure 


Here, v(x; — L) is equal to zero for x; < L and is very large for x; > L, so 
that penetration of the wall by the gas particles is prevented. We then obtain 
for the force on the molecules 


8 The symbol (33), denotes the partial derivative of S with respect to the energy 
FE, holding a constant, etc. 
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Ov 6) OH 
PoDRoD (se) tape ae (2.4.21) 


The pressure is defined as the average force per unit area which the molecules 
exert upon the wall, from which we find using (2.4.21) that 


(F) OH 
= = 2.4.22 
‘ A OV ( ) 
In this case, the general relations (2.4.18) and (2.4.19) become 
1 
dS = pide + PdV) (2.4.23) 
and 
1 Os P Os 
cag (pence — = { — A 2.4.24 
7-(e),. 7-(¥), (234 


Solving (2.4.23) for dE, we obtain 
dE =TdS — Pdv , (2.4.25) 


a relation which we will later identify as the First Law of Thermodynamics 
[for a constant particle number; see Eqs. (3.1.3) and (3.1.3’)]. Comparison 
with phenomenological thermodynamics gives an additional justification for 
the identification of T with the temperature. As a result of 


—PdV = ay = (FdL) =6W , 


the last term in (2.4.25) denotes the work dW which is performed on the 
system causing the change in volume. 

We are now interested in the pressure distribution in two subsystems, 
which are separated from each other by a movable partition, keeping the 
particle numbers in each subsystem constant (Fig. 2.6’). The energies and 
volumes are additive 


E=F,+E,, V=aVit+v. (2.4.26) 


The probability that subsystem 1 has the energy FE, and the volume V, is 
given by 

H, + Hp - E) 
21 2(E,V) 


6 
w (F,,Vi) = janars ( 6( Ay E,)0(q E Vi )O(qo E V2) 


_ 2 (Fi, Vi) 22( Ee, V2) 
= D1 9(E, V) : (2.4.27a) 
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Fig. 2.6’. Two systems which are isolated from 
the external environment, separated by a movable 
wall which permits the exchange of energy. 


In (2.4.27a), the function O(q € Vi) means that all the spatial coordinates 
of the sub-phase space 1 are limited to the volume Vj and correspondingly, 
O(q2 € V2). Here, both E; and Vj are statistical variables, while in (2.4.2b), 
V, was a fixed parameter. Therefore, the normalization factor is given here 
by 


21 2(E,V) = pas ju 21(E1, Vi) Qo(E - F1,V —Vi). (2.4.27b) 


In analogy to (2.4.3), the most probable state of the two systems is found by 
the condition of vanishing derivatives of (2.4.27a) 


Ow (F1,Vi) a Ow (£1, Vi) 
Ee 0 and py 
From this, it follows that 


=0. 


0 0 
dE, 08 2(F1,Vi) = dE, °8 2(E2, V2) > T, = Te 


and (2.4.28) 


na log 2Q(F,, Vi) = ami 922( E>, V2) >P,=P. 
In systems which are separated by a movable wall and can exchange energy, 
the equilibrium temperatures and pressures are equal. 

The microcanonical density matrix evidently depends on the energy E 
and on the volume V, as well as on the particle number N. If we regard 
these parameters likewise as variables, then the overall variation of S must 
be replaced by 


1 P Lb 
dS = pie | pw pan . (2.4.29) 
Here, we have defined the chemical potential u by 
Hyd 
T= kay log Q(E,V,N). (2.4.30) 


The chemical potential is related to the fractional change in the number 
of accessible states with respect to the change in the number of particles. 
Physically, its meaning is the change in energy per particle added to the 
system, as can be seen from (2.4.29) by solving that expression for dE. 
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2.5 Thermodynamic Properties of Some 
Non-interacting Systems 


Now that we have introduced the thermodynamic concepts of temperature 
and pressure, we are in a position to discuss further the examples of a classical 
ideal gas, quantum-mechanical oscillators, and non-interacting spins treated 
in Sect. 2.2.2. In the following, we will derive the thermodynamic conse- 
quences of the phase-space surface or number of states 2 (FE) which we cal- 
culated there for those examples. 


2.5.1 The Ideal Gas 


We first calculate the thermodynamic quantities introduced in the preceding 
sections for the case of an ideal gas. In (2.2.16), we found the phase-space 
volume in the limit of a large number of particles: 


3N 


Q(E) = fare(e- (a0) - (x) (Ser) . (2.2.16) 


If we insert (2.2.16) into (2.3.6), we obtain the entropy as a function of the 


energy and the volume: 
3 
V (4nrmE\? 5 


Eq. (2.5.1) is called the Sackur—Tetrode equation. It represents the starting 
point for the calculation of the temperature and the pressure. The temperature 
is, from (2.4.4), defined as the reciprocal of the partial energy derivative of 
the entropy, T~! = ($3), = kN3E~', from which the caloric equation of 
state of the ideal gas follows immediately: 


S(E,V) =kN log 


E= SNE ; (2.5.2) 


With (2.5.2), we can also find the entropy (2.5.1) as a function of T and V: 


V ( 2amkT 2 5 
x( 72 ) o]. (2.5.3) 


The pressure is obtained from (2.4.24) by taking the volume derivative of 
(2.5.1) 


as kTN 
p=7(5) =". (2.5.4) 


S(T,V) =kN log 
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This is the thermal equation of state of the ideal gas, which is often written 
in the form 


PV =N&T. (2.5.4’) 


The implications of the thermal equation of state are summarized in the 
diagrams of Fig. 2.7: Fig. 2.7a shows the PVT surface or surface of the 
equation of state, i.e. the pressure as a function of V and T’. Figs. 2.7b,c,d 
are projections onto the PV-, the TV- and the PT-planes. In these dia- 
grams, the isotherms (J = const), the isobars (P = const), and the iso- 
chores (V = const) are illustrated. These curves are also drawn in on the 
PVT surface (Fig. 2.7a). 


Remarks: 


(i) It can be seen from (2.5.2) that the temperature increases with the energy 
content of the ideal gas, in accord with the usual concept of temperature. 
(ii) The equation of state (2.5.4) also provides us with the possibility of 
measuring the temperature. The determination of the temperature of 
an ideal gas can be achieved by measuring its volume and its pressure. 


surface of P 
equation of state 


\ \ isotherms 
\ \_ Tsconst; pV=const 
R. “Si. 
Vv 
b) 
Vv 
T P. : 
isochores 


isobars Veconst 


/  Pe=const 


Fig. 2.7. The equation of state of the ideal gas: (a) surface of the equation of 
state, (b) P-V diagram, (c) T-V diagram, (d) P-T diagram 
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The temperature of any given body can be determined by bringing it 
into thermal contact with an ideal gas and making use of the fact that 
the two temperatures will equalize [Eq. (2.4.5)]. The relative sizes of the 
two systems (body and thermometer) must of course be chosen so that 
contact with the ideal gas changes the temperature of the body being 
investigated by only a negligible amount. 


*2.5.2 Non-interacting Quantum Mechanical Harmonic 
Oscillators and Spins 


2.5.2.1 Harmonic Oscillators 


From (2.2.31) and (2.3.6), it follows for the entropy of non-coupled harmonic oscil- 
lators with e = E/N, that 


(2.5.5) 


e+ thw e+ thw e— $hw e— thw 
og log F 
hw hw hw 


where a logarithmic term has been neglected. From Eq. (2.4.4), we obtain for the 
temperature 


OS\~* hw é+thw\ 

T= a2 l 2 ; 2.5.6 
(5) k (108 5) ( ) 
inn . E+4Nhw fw ; 

From this, it follows via -—}5,5 = e*T that the energy as a function of the 
2 


temperature is given by 


1 1 
B= Nhe | so 5} ; (2.5.7) 
The energy increases monotonically with the temperature (Fig. 2.8). Limiting cases: 
For E + NS (the minimal energy), we find 

1 


Cg 0 (2.5.8a) 


and for EF — oo 


1 
CP (2.5.8b) 
log 1 
We can also see that for T — 0, the heat capacity tends to zero: Cy = ($4), — 0; 
this is in agreement with the Third Law of Thermodynamics. 


2.5.2.2 A Paramagnetic Spin-$ System 


Finally, we consider a system of N magnetic moments with spin 4 which do not 
interact with each other; or, more generally, a system of non-interacting two-level 
systems. We refer here to Sect. 2.2.3.2. From (2.2.35), the entropy of such a system 
is given by 
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15 4 
1.0 
Z 
= 
wi 
05 
Fig. 2.8. Non-coupled harmonic oscilla- 
°9 05 1.0 15 tors: the energy as a function of the tem- 
kT/ (hw) perature. 
S(£) = at (14 ¢) log + (1 c)log +5 ©} (2.5.9) 


with e = E/Nh. From this, we find for the temperature: 


OS \ ">. 2h 1—e\* 
T= = ] : 2.5.1 
(SB) k (10 =) ant) 
The entropy is shown as a function of the energy in Fig. 2.9, and the temperature 


as a function of the energy in Fig. 2.10. The ground-state energy is Eo = —Nh. For 
E — —Nh, we find from (2.5.10) 


lim T=0. (2.5.11) 
E>—Nh 
The temperature increases with increasing energy beginning at Kyo = —Nh mono- 


tonically until FE = 0 is reached; this is the state in which the magnetic moments are 
completely disordered, i.e. there are just as many oriented parallel as antiparallel 
to the applied magnetic field h. The region E > 0, in which the temperature is 
negative (!), will be discussed later in Sect. 6.7.2. 


entropy S/Nk 
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€ 
Lt 
2 
z 
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b 


Fig. 2.9. The entropy as a function Fig. 2.10. The temperature as a func- 
of the energy for a two-level system tion of the energy for a two-level system 
(spin— 5—paramagnet) (spin— 4 —paramagnet) 
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2.6 The Canonical Ensemble 


In this section, the properties of a small subsystem 1 which is embedded in a 
large system 2, the heat bath,® will be investigated (Fig. 2.11). We first need 
to construct the density matrix, which we will derive from quantum mechanics 
in the following section. The overall system is taken to be isolated, so that it 
is described by a microcanonical ensemble. 


Fig. 2.11. A canonical ensemble. Subsystem 
1 is in contact with the heat bath 2. The over- 
all system is isolated. 


2.6.1 The Density Matrix 
The Hamiltonian of the total system 


is the sum of the Hamiltonians H, and Hy» for systems 1 and 2 and the 
interaction term W. The latter is in fact necessary so that the two subsystems 
can come to equilibrium with each other; however, W is negligibly small 
compared to H; and Ho. Our goal is the derivation of the density matrix for 
subsystem 1 alone. We will give two derivations here, of which the second is 
shorter, but the first is more useful for the introduction of the grand canonical 
ensemble in the next section. 


(i) Let Pp,,, be the probability that subsystem 1 is in state n with an energy 
eigenvalue F,,,. Then for Pz,,, using the microcanonical distribution for the 
total system, we find 


/ 1 
Pr, = = 2.6.2 
Ss 21 2(E)A 21,2(E) ( ) 
The sum runs over all the states of subsystem 2 whose energy F2,, lies in the 
interval F — Ey, < Eo, < E+ A— Ej,. In the case that subsystem 1 is very 
much smaller than subsystem 2, we can expand the logarithm of 22(E— E\,,) 
in Ein: 


° A heat bath (or thermal reservoir) is a system which is so large that adding or 
subtracting a finite amount of energy to it does not change its temperature. 
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— (EB — Ey + E, — Ein) 


Pax, 
, Ma(F) (2.6.3) 
= 22(E — E;) e(B1—Bin)/AT 71g Ean [AT 
21 2(E) 


~ \-1 
This expression contains T = (kay log 22(E — By) , the temperature of 
the heat bath. The normalization factor Z, from (2.6.3), is given by 
= 2(F) are (2.6.4) 
~ 22( Ed) 


However, it is important that Z can be calculated directly from the properties 
of subsystem 1. The condition that the sum over all the Pz,,, must be equal 
to 1 implies that 


Ga Serr nh ase Ba (2.6.5) 


Z is termed the partition function. The canonical density matrix is then given 
by the following equivalent representations 


po= d, Pen |n) (n| = eS maa |n) (n| = Z-te“#2/FF | (2.6.6) 


(ii) The second derivation starts with the fact that the density matrix p 
for subsystem 1 can be obtained form the microcanonical density matrix by 
taking the trace over the degrees of freedom of system 2: 


(Hi +H,-E)  2(E-Hy) 


Po = Tre puc = Tre 


Q12(E ~ Qyo(EB 
oe 1.2(F) Mal) (2.6.7) 
— M(B ~ Ei + Bi ~ fh) (8 ~ Fh) e,-ayer 
212(E) 21,2(E) , 


This derivation is valid both in classical physics and in quantum mechanics, as 
is shown specifically in (2.6.9). Thus we have also demonstrated the validity 
of (2.6.6) with the definition (2.6.5) by this second route. 


Expectation values of observables A which act only on the states of sub- 
system 1 are given by 


(A) =TriTrapwcA=TripcdA. (2.6.8) 


Remarks: 


(i) The classical distribution function: 
The classical distribution function of subsystem 1 is obtained by integration 
of Puc over In 
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po(m,P1) = f ars prc 


1 
= ju Rstey — Ai(q1,p1) — H2(q2,p2)) (2.6.9) 
_ 22(E -— Hi(u,p1)) 
921,2(E) ; 
If we expand the logarithm of this expression with respect to H,, we obtain 
polaim)=Z te m/e (2.6.10a) 
Z= pen ei (qipi)/kT (2.6.10b) 


Here, Z is called the partition function. Mean values of observables A(q,p1) 
which refer only to subsystem 1 are calculated in the classical case by means 
of 


(A) = pan pc(M, Pi)A(H, P1) 5 (2.6.10c) 


as one finds analogously to (2.6.8). 

(ii) The energy distribution: 

The energy distribution w (£1) introduced in Sect. 2.4.1 can also be calculated clas- 
sically and quantum mechanically within the framework of the canonical ensemble 
(see problem 2.7): 


1 E,+Ay, 
w (Ei) = 5 / dE S~5(E} — Bin) Per, 
By to (2.6.11) 
no 922(E — E}) 1 a = 22(E — E,)21(E1) 
aby a 2(E) 


n 


This expression agrees with (2.4.2a). 
(iii) The partition function (2.6.5) can also be written as follows: 


Z= fae Trie /"7 §(H, — FE) = aes Trie @1/*F §(Hy — F:1) 
(2.6.12) 
= par @ Pi/kT 2Q1(E1) . 


(iv) In the derivation of the canonical density matrix, Eq. (2.6.7), we ex- 
panded the logarithm of 22(£ — H).We show that it was justified to termi- 
nate this expansion after the first term of the Taylor series: 

2o(E — Hy) = 22(E — BE, — (Hi — £,)) 


1 me), 1 (a1/T eyo 
Ay Ey)+4( 7) Fy)? +... 


- te (A —B1)— seo ja, Ey)?+... 


= Q2(E — E,)e or (Ai Ey)(1t+s¢6(1 Ey)+...) 


2 
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where C is the heat capacity of the thermal bath. Since, owing to the large size 
of the thermal bath, (H; — E,) < TC holds (to be regarded as an inequality 
for the eigenvalues), it is in fact justified to ignore the higher-order corrections 
in the Taylor expansion. 

(v) In later sections, we will be interested only in the (canonical) subsys- 
tem 1. The heat bath 2 enters merely through its temperature. We shall then 
leave off the index ‘1’ from the relations derived in this section. 


2.6.2 Examples: the Maxwell Distribution and the Barometric 
Pressure Formula 


Suppose the subsystem to consist of one particle. The probability that its 
position and its momentum take on the values x and p is given by: 


2 
w(x, p) Bx dp = Ce-8( Fe tV 0) Pr Bp (2.6.13) 


Here, @ = jp and V(x) refers to the potential energy, while C = C'C” 
is a normalization factor!®. Integration over spatial coordinates gives the 
momentum distribution 


2 
w(p) dp = Cle 82m dp. (2.6.14) 
If we do not require the direction of the momentum, i.e. integrating over all 
angles, we obtain 


2 
w(p) dp = ArC'e 82m p? dp : (2.6.15) 


this is the Maxwell velocity distribution. Integration of (2.6.13) over the mo- 
mentum gives the spatial distribution: 


w(x) Ba = Ce PVS) Be , (2.6.16) 


If we now set the potential V(x) equal to the gravitational field V(x) = mgz 
and use the fact that the particle-number density is proportional to w(x), 
we obtain [employing the equation of state for the ideal gas, (2.5.4’), which 
relates the pressure to the particle-number density] an expression for the 
altitude dependence of the pressure, the barometric pressure formula: 


P(z) = Pye ™a7/kF (2.6.17) 
(cf. also problem 2.15). 


10 O = (ee and C” = @ Bae PV)” 


2a 
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2.6.3 The Entropy of the Canonical Ensemble 
and Its Extremal Values 


From Eq. (2.6.6), we find for the entropy of the canonical ensemble 


Tes 
So = —ki(log pc) = pt + klog Z (2.6.18) 
with 
B=(H). (2.6.18") 


Now let p correspond to a different distribution with the same average energy 
(H) = E; then the inequality 


S[p] = —k Tr (plog p) < —k Tr (p log pc) 


H 1 (2.6.19) 
=k Tr(p (= — log Z)) = Z(H) + klog Z = 
t (p (Ga — log Z)) = F(A) + blog Z = Sc 
results. Here, the inequality in (2.3.3) was used along with p; = pc. The 
canonical ensemble has the greatest entropy of all ensembles with the same 
average energy. 


2.6.4 The Virial Theorem and the Equipartition Theorem 


2.6.4.1 The Classical Virial Theorem 
and the Equipartition Theorem 


Now, we consider a classical system and combine its momenta and spatial 
coordinates into x; = p;,q:. For the average value of the quantity «32 we 
of 


find the following relation: 


(xo) => 2 f arn ee elit 
Ox; Ox; 


de—H/kT 
= 2 f are (-ke) = kT 6;;, (2.6.20) 
Ox; 

where we have carried out an integration by parts. We have assumed that 
exp(—H(p,q)/kT) drops off rapidly enough for large p and q so that no 
boundary terms occur. This is the case for the kinetic energy and potentials 
such as those of harmonic oscillators. In the general case, one would have to 
take the wall potential into account. Eq. (2.6.20) contains the classical virial 
theorem as a special case, as well as the equipartition theorem. 

Applying (2.6.20) to the spatial coordinates g;, we obtain the classical 
virial theorem 


J 
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We now specialize to the case of harmonic oscillators, i.e. 


V= De Se. (2.6.22) 


For this case, it follows from (2.6.21) that 


vie 


; (2.6.23) 


The potential energy of each degree of freedom has the average value kT/2. 
Applying (2.6.20) to the momenta, we find the equipartition theorem. We 
take as the kinetic energy the generalized quadratic form 


Exin = S- AKnpipe, With aj, = any - (2.6.24) 
i,k 


For this form, we find OF bin =>, (GikPe+OniPr) = >, 2dinpe and therewith, 
after multiplication Be Di and summation over all 2, 


: k 


a 


Now we take the thermal average and find from (2.6.20) 
OH 
(Sag) = 2(Ekin) = 3 NkT ; (2.6.26) 


i.e. the equipartition theorem. The average kinetic energy per degree of free- 
dom is equal to $kT. 

As previously mentioned, in the potential V, the interaction 
4 im.n U(Xmnl) (with Xmn = Xm — Xn) of the particles with each other and 
in general their interaction with the wall, Vian, must be taken into account. 
Then using (2.6.23) and (2.6.25), we find 


PV = 2(Ehin) — 2 (mm nel (2.6.27) 


m,n 


The term PV results from the wall potential. The second term on the right- 
hand side is called the ‘virial’ and can be expanded in powers of x (virial 
expansion, see Sect. 5.3). 


*Proof of (2.6.27): 
We begin with the Hamiltonian 


H= as 5 = Xm) + Vwall ry (2.6.28) 
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— 4 Fig. 2.12. Quantities related to the wall po- 
tential and the pressure: increasing the volume 
L, éL, 


on displacing a wall by 6L, 


and write for the pressure, using (2.4.22): 


(St) tals = 73M 0, + aE, +E 


or 


Now, Vwau has the form (cf. Fig. 2.12) 


Vat = Voo ¥ {O(i1 — Li) + O(ai2 — Lo) + O(xig — La)} (2.6.30) 


Here, Vx. characterizes the barrier represented by the wall. The kinetic energy of 
the particles is much smaller than V... Evidently, a = —Voo >, 8(@n1 — £1) 
and therefore 


(S521 ee = (0 2m Vood (tna — Ih) = (0 1Va05 (ami — Li)) 


= - (i Ft) = (nn SE), 


With this, (2.6.29) can be put into the form 


PV = 3 3 on =~ Vw!) = KEN ; ( ~ vs 5”) (2.6.31) 
2 1 a 
= 5 (Ekin) - 5(L Xr mma) a =a, (2.6.32) 


In the first line, the virial theorem (2.6.21) was used, and we have abbreviated the 
sum of the pair potentials as v. In the second line, kT was substituted by (2.6.26) 
and the derivative of the pair potentials was written out explicitly, whereby for 
example 


0, O ( ina pace — X2) 
Ly Ae r ©2 Bas U\X1 ».@) “1 x2 A(x1 = 22) 


was used, and 21 (#2) refers to the « component of particle 1(2). With (2.6.32), we 
have proven (2.6.27). 
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*2.6.4.2 The Quantum-Statistical Virial Theorem 


Starting from the Hamiltonian 
3.2 
H= a Pn ae dV V (Xn — Xwail) a SS U(Xn — Xm) , (2.6.33) 


it follows that!! 


— SO xn + Vnv(x 0 —%m)) (2.6.34) 


Now, (| [H, >>), Xn - Pn] |v) = 0 for energy eigenstates. 
We assume the density matrix to be diagonal in the basis of the energy 
eigenstates; from this, it follows that 


2( Ein) Os Xn“ VnV (Xn — Xwall) ) 
_ (~ S> Xn Vnv(Xn — en) =0. (2.6.35) 


n mén 


With (2.6.31), we again obtain the virial theorem immediately 
hie) = 3Py = Hey ee ~ Xm) + Vo(Xn — Xm)) =0. (2.6.27) 


Eq. (2.6.27) is called the virial theorem of quantum statistics. It holds both 
classically and quantum mechanically, while (2.6.21) and (2.6.26) are valid 
only classically. 

From the virial theorem (2.6.27), we find for ideal gases: 


PV = = (Exim) = 


aS Mi (y2) = smn (v?) . (2.6.36) 


n 


For non-interacting classical particles, the mean squared velocity per particle, 
(v?), can be computed using the Maxwell velocity distribution; then from 
(2.6.36), one again obtains the well-known equation of state of the classical 
ideal gas. 


" See e.g. QM I, p. 218. 
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2.6.5 Thermodynamic Quantities in the Canonical Ensemble 


2.6.5.1 A Macroscopic System: The Equivalence of the Canonical 
and the Microcanonical Ensemble 


We assume that the smaller subsystem is also a macroscopic system. Then 
it follows from the preceding considerations on the width of the energy dis- 
tribution function w (E,) that the average value of the energy FE, is equal to 
the most probable value E\, i.e. 


Ey =f,. (2.6.37) 


We now wish to investigate how statements about thermodynamic quantities 
in the microcanonical and the canonical ensembles are related. To this end, 
we rewrite the partition function (2.6.4) in the following manner: 

Q)2(E) 


Z = 0 (B 2 (E E ne — wl Bry Ot Be 2 FF ; 
1 1 2 — fy 


(2.6.38) 
According to (2.4.8), the typical N\-dependence of w (£1) is given by 


w(E1) ~ Np teW 8 BaP Mae? (2.6.39) 


with the normalization factor determined by the condition [ dE, w (EF) = 1. 
From (2.4.14), the N-dependence takes the form of Eq. (2.6.39) even for 
interacting systems. We thus find from (2.6.38) that 


Z =e MT 2, (f,)/M, . (2.6.40) 


Inserting this result into Eq. (2.6.18), we obtain the following expression for 
the canonical entropy [using (2.6.37) and neglecting terms of the order of 
log Ni]: 
| em a au E 

Sco = plh a Ey + kT log Q(E1)) = Surc(E1) . (2.6.41) 
From (2.6.41) we can see that the entropy of the canonical ensemble is equal 
to that of a microcanonical ensemble with the energy E,(= £). In both 
ensembles, one obtains identical results for the thermodynamic quantities. 


2.6.5.2 Thermodynamic Quantities 


We summarize here how various thermodynamic quantities can be calculated 
for the canonical ensemble. Since the heat bath enters only through its tem- 
perature 7’, we leave off the index 1 which indicates subsystem 1. Then for 
the canonical density matrix, we have 


po =e 98 /Z (2.6.42) 
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with the partition function 
Sate ak’; (2.6.43) 
where we have used the definition 6 = on We also define the free energy 
F=-kTlogZ. (2.6.44) 
For the entropy, we obtain from (2.6.18) 
So= a(E + kT log Z) . (2.6.45) 


The average energy is given by 


= 0 () 
E=(H) =—-~logZ =kT?— logZ. 2.6.4 
(H) ag 08 ap Wes (2.6.46) 
The pressure takes the form: 
OH OlogZ 
P=—( —)=kT : 2.6.47 
(5) OV ( ) 


The derivation from Sect. 2.4.3, which gave — (gH) for the pressure, is of 


course still valid for the canonical ensemble. From Eq. (2.6.45), it follows 
that 


FHSE-TSo: (2.6.48) 


Since the canonical density matrix contains T and V as parameters, F' is 
likewise a function of these quantities. Taking the total differential of (2.6.44) 
by applying (2.6.43), we obtain 


aT py __ 1 OH —BH 
dF = —kdT log Tr e 8" ral ae BEE) 


1 = 
- —7lE +kT log Z)dT + (ev 


and, with (2.6.45)—(2.6.47), 

dF(T,V) = —ScdT — Pdv . (2.6.49) 
From Egs. (2.6.48) and (2.6.49) we find 

dE =TdSc — Pav . (2.6.50a) 


This relation corresponds to (2.4.25) in the microcanonical ensemble. In the 
limiting case of macroscopic systems, F = EF = EF and So = Sy. 
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The First Law of thermodynamics expresses the energy balance. The most 
general change in the energy of a system with a fixed number of particles is 
composed of the work 6W = —PdV performed on the system together with 
the quantity of heat 6Q transferred to it: 


dE =6Q+6W. (2.6.50b) 
Comparison with (2.6.50a) shows that the heat transferred is given by 
6Q =TdS (2.6.50c) 


(this is the Second Law for transitions between equilibrium states). 

The temperature and the volume occur in the canonical partition function 
and in the free energy as natural variables. The partition function is calcu- 
lated for a Hamiltonian with a fixed number of particles.!? As in the case of 
the microcanonical ensemble, however, one can here also treat the partition 
function or the free energy, in which the particle number is a parameter, as 
a function of N. Then the total change in F is given by 


F 
dF =—Scat -Pav+(2~) an, (2.6.51) 
ON TLV 


and it follows from (2.6.48) that 


7 F 
dE = TdSo — PdV + (sm) dN . (2.6.52) 
ON ) ny 


In the thermodynamic limit, (2.6.52) and (2.4.29) must agree, so that we find 


(sr) ny =p. (2.6.53) 


2.6.6 Additional Properties of the Entropy 
2.6.6.1 Additivity of the Entropy 


We now consider two subsystems in a common heat bath (Fig. 2.13). As- 
suming that each of these systems contains a large number of particles, the 
energy is additive. That is, the interaction energy, which acts only at the 
interfaces, is much smaller than the energy of each of the individual systems. 
We wish to show that the entropy is also additive. We begin this task with 
the two density matrices of the subsystems: 


e FA _ e PH 
Z; ’ p2= ca 


p= (2.6.54a,b) 


!2 Exceptions are photons and bosonic quasiparticles such as phonons and rotons 
in superfluid helium, for which the particle number is not fixed (Chap. 4). 
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Fig. 2.13. Two subsystems 1 and 2 in one 
heat bath 


The density matrix of the two subsystems together is 

P= P1 P25 (2.6.54c) 
where once again W < Hj, H2 was employed. From 

(log p) = (log pi) + (log p2) (2.6.55) 
it follows that the total entropy S is given by 

S=S,4+ 52, (2.6.56) 
the sum of the entropies of the subsystems. Eq. (2.6.56) expresses the fact 
that the entropy is additive. 


*2.6.6.2 The Statistical Meaning of Heat 


Here, we want to add a few supplementary remarks that concern the statis- 
tical and physical meaning of heat transfer to a system. We begin with the 
average energy 


RS=(H) "Tr pi (2.6.57a) 


for an arbitrary density matrix and its total variation with a fixed number 
of particles 


dE = Tr (dp H + pdH) , (2.6.57b) 


where dp is the variation of the density matrix and dH is the variation of the 
Hamiltonian (see the end of this section). The variation of the entropy 


S = —kTr plogp (2.6.58) 
is given by 
dS = —k Tr (dplog p + 5) (2.6.59) 


Now we have 


Tr do =0, (2.6.60) 
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since for all density matrices, Tr p = Tr(p + dp) = 1, from which it follows 
that 


dS = —kTr (log pdp) . (2.6.61) 


Let the initial density matrix be the canonical one; then making use of 
(2.6.60), we have 


dS = ah (H dp) . (2.6.62) 


If we insert this into (2.6.57b) and take the volume as the only parameter in 
H,ie. dH = ge dV, we again obtain (cf. (2.6.50a)) 


i oH 
B=Td a NA 6. 
d s+ (5) V (2.6.63) 


We shall now discuss the physical meaning of the general relation (2.6.57b): 

lst term: this represents a change in the density matrix, i.e. a change in the 

occupation probabilities. 

2nd term: the change of the Hamiltonian. This means a change in the energy 

as a result of influences which change the energy eigenvalues of the system. 
Let p be diagonal in the energy eigenstates; then 


E=S nik, , (2.6.64) 
and the variation of the average energy has the form 

dE = dp:E; + > pid E; . (2.6.65) 
Thus, the quantity of heat transferred is given by 


5Q = So dpi; . (2.6.66) 


A transfer of heat gives rise to a redistribution of the occupation probabilities 
of the states |2). Heating (heat input) increases the populations of the states 
at higher energies. Energy change by an input of work (work performed on 
the system) produces a change in the energy eigenvalues. In this process, the 
occupation numbers can change only in such a way as to keep the entropy 
constant. 

When only the external parameters are varied, work is performed on the 
system, but no heat is put into it. In this case, although do may exhibit 
a change, there is no change in the entropy. This can be shown explicitly 
as follows: From Eq. (2.6.61), we have dS = —kTr (log pdp). It then follows 
from the von Neumann Eq. (1.4.8), 6 = }[p, H(V(t))], which is valid also 
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for time-dependent Hamiltonians, e.g. one containing the volume V(t): 


S=—-kTr (log p p) 
J 4 (2.6.67) 
= —| Tr (logp [p, H]) = — 5 Tr (H [log p, p]) = 0. 


The entropy does not change, and no heat is put into the system. An example 
which demonstrates this situation is the adiabatic reversible expansion of 
an ideal gas (Sect. 3.5.4.1). There, as a result of the work performed, the 
volume of the gas changes and with it the Hamilton function; furthermore, 
the temperature of the gas changes. These effects together lead to a change 
in the distribution function (density matrix), but however not of the entropy. 


2.7 The Grand Canonical Ensemble 


2.7.1 Systems with Particle Exchange 


After considering systems in the preceding section which can exchange energy 
with a heat bath, we now wish to allow in addition the exchange of matter 
between subsystem 1 on the one hand and the heat bath 2 on the other; this 
will be a consistent generalization of the canonical ensemble (see Fig. 2.14). 
The overall system is isolated. The total energy, the total particle number 
and the overall volume are the sums of these quantities for the subsystems: 


E=F,+E), N=N,+N2., V=Vi4+Ve. (2.7.1) 


Fig. 2.14. Regarding the grand canonical 
ensemble: two subsystems 1 and 2, between 
which energy and particle exchange is permit- 
ted. 


The probability distribution of the state variables £,,.N1, and V, of sub- 
system 1 is found in complete analogy to Sect. 2.4.3, 


0 (E,, Ni, Vi) Q(B — F,,N -M,V —- Vi) 
2(E,N,V) 


w (£1, Ni, V1) = (2.7.2) 


The attempt to find the maximum of this distribution leads again to equality 
of the logarithmic derivatives, in this case with respect to FE, V and N. The 
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first two relations were already seen in Eq. (2.4.28) and imply temperature 
and pressure equalization between the two systems. The third formula can 
be expressed in terms of the chemical potential which was defined in (2.4.29): 


a) 
p= kT > log Q(E,N,V) = —T (Fr) (2.7.3) 


and we obtain finally as a condition for the maximum probability the equal- 
ization of temperature, pressure, and chemical potential: 


T, = Ta, P, = Py, fi = pe. (2.7.4) 


2.7.2 The Grand Canonical Density Matrix 


Next, we will derive the density matrix for the subsystem. The probability 
that in system 1 there are N; particles which are in the state |n) at the 
energy F1,,(.N1) is given by: 


1 


P(N1, Ein(™1), Vi) = Q(E,N,V)A 


E—Ein(N1)<Eam(N2)<E-Ein(N1)+A 


22(E = Ein, N > Ni, V2) 
Q(E, N,V) 


(2.7.5) 


In order to eliminate system 2, we carry out an expansion in the variables Fi, 
and N, with the condition that subsystem 1 is much smaller than subsys- 
tem 2, analogously to the case of the canonical ensemble: 


BONG Pin(Ni), ViiaZere= A Pea (2.7.6) 


We thus obtain the following expression for the density matrix of the grand 
canonical ensembles: 


pg = Zale Fea) /kE 7) 


where the grand partition function Zg (or Gibbs distribution) is found from 
the normalization of the density matrix to be 


Zqo=Tr Ca tee ee 


Se Sie RMP E TRNAEET = Yo ACN en ahe 3 (28) 
Ni Ni 


13 See also the derivation in second quantization, p. 69 
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The two trace operations Tr in Eq. (2.7.8) refer to different spaces. The trace 
after the second equals sign refers to a summation over all the diagonal matrix 
elements for a fixed particle number Nj, while the Tr after the first equals sign 
implies in addition the summation over all particle numbers N; = 0,1,2,.... 
The average value of an operator A in the grand canonical ensemble is 


(A) = Tr (pe) 


where the trace is here to be understood in the latter sense. 
In classical statistics, (2.7.7) remains unchanged for the distribution func- 


tion, while Tr — >> na f{dIn, must be replaced by the 6Nj-dimensional 
operator dI'y, = oe 


From (2.7.5), Zg* can also be given in terms of 


§22(E, N,V —Vi) _ .-pvi/er 


Zoi= 
7 Q(B, N,V) 


(2.7.9) 


for Vi < V; recall Eqns. (2.4.24) and (2.4.25). 
From the density matrix, we find the entropy of the grand canonical en- 
semble, 


i os = 
Sg = —k(log pg) = qe —uUN)+klog Ze. (2.7.10) 


Since the energy and particle reservoir, subsystem 2, enters only via its tem- 
perature and chemical potential, we dispense with the index 1 here and in 
the following sections. 

The distribution function for the energy and the particle number is ex- 
tremely narrow for macroscopic subsystems. The relative fluctuations are 
proportional to the square root of the average number of particles. There- 
fore, we have E = E and N = N for macroscopic subsystems. The grand 
canonical entropy, also, may be shown (cf. Sect. 2.6.5.1) in the limit of macro- 
scopic subsystems to be identical with the microcanonical entropy, taken at 
the most probable values (with fixed volume V)) 


kj=E,, M=N (2.7.11) 


Se = Suc(Ei,M) - (2.7.12) 


2.7.3 Thermodynamic Quantities 


In analogy to the free energy of the canonical ensemble, the grand potential 
is defined by 


@ = —kT log Zc, (2.7.13) 
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from which with (2.7.10) we obtain the expression 
@(T,u,V)=E-TSg—wuN. (2.7.14) 


The total differential of the grand potential is given by 


O® O® Of 
d® = | —— dT + a) dV + (=) du . 2.7.15 
Gas (or Ty Ou VT a ( 


The partial derivatives follow from (2.7.13) and (2.7.8): 


kT? 


Of OH Of 1 = 
(57) = Br) =P > adn, =P =F 


(2.7.16) 


Of 1 1 = — 
(ar), = —klog Zg — kT —5(H —- uN) = pie —- E+ uN) =—-S¢ 


If we insert (2.7.16) into (2.7.15), we find 


d® = —SgdT — PdV — Ndp - (2-747) 
From this, together with (2.7.14), it follows that 
dE = TdSq — PdV + pd ; (2.7.18) 


this is again the First Law. As shown above, for macroscopic systems we can 
use simply £, N and S in (2.7.17) and (2.7.18) instead of the average values of 
the energy and the particle number and Sg; we shall do this in later chapters. 
For a constant particle number, (2.7.18) becomes identical with (2.4.25). The 
physical meaning of the First Law will be discussed in detail in Sect. 3.1. 
We have considered the fluctuations of physical quantities thus far only in 
Sect. 2.4.2. Of course, we could also calculate the autocorrelation function 
for energy and particle number in the grand canonical ensemble. This shows 
that these quantities are extensive and their relative fluctuations decrease 
inversely as the square root of the size of the system. We shall postpone 
these considerations to the chapter on thermodynamics, since there we can 
relate the correlations to thermodynamic derivatives. 

We close this section with a tabular summary of the ensembles treated in 
this chapter. 
Remark concerning Table 2.1: The thermodynamic functions which are found 
from the logarithm of the normalization factors are the entropy and the 
thermodynamic potentials F’ and ® (see Chap. 3). The generalization to 
several different types of particles will be carried out in Chap. 5. To this end, 
one must merely replace N by {N;} and pu by {p;}. 
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Table 2.1. The most important ensembles 


Ensemble microcanonical canonical grand canonical 


Physical — situa- enetey. and 


hon isolated energy exchange particle 
exchange 
EVM * 1 H/kT Z Ga Vin) * 
‘ : 2(B,V,N ae WV; 
Density matrix StH E) ZEVNe oUt aN) /kr 
tok te) 2 (E,V,N) = Z(T,V,N) = Za(T, V, w) = 
Normalization Tr 6(H — EB) Tr eo HIRT Tr oe HO EN)/AT 
Independent E,V,N T,V,N TV, 
variables 
Thermodynamic 5 F e 
functions 


2.7.4 The Grand Partition Function 
for the Classical Ideal Gas 


As an example, we consider the special case of the classical ideal gas. 


2.7.4.1 Partition Function 
For the partition function for N particles, we obtain 


1 2 
y= sagan f das -dasy f dps. -dpa WP E9812" 
(2.7.19) 


VN famm\"® 1 (V\% 
~ N! \ Bh? ~ NI \ 8 
with the thermal wavelength 
N= h/V20mkT . (2.7.20) 
Its name results from the fact that a particle of mass m and momentum h/ 
will have a kinetic energy of the order of kT. 
2.7.4.2 The Grand Partition Function 


Inserting (2.7.19) into the grand partition function (2.7.8), we find 


Co [oe) 1 


VEN” 2 eccpics 
Ze= yin Ss Ta (5) =r, (2.7.21) 
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where the fugacity 
z= ef (2.7.22) 


has been defined. 


2.7.4.3 Thermodynamic Quantities 
From (2.7.13) and (2.7.21), the grand potential takes on the simple form 
@ = —kT log Za = —kT2V/d* . (2723) 


From the partial derivatives, we can compute the thermodynamic relations. 


Particle number 


N=- ($=) = eV ie (2.7.24) 
Ou TV 
Pressure 
Of 
PV =-V (Fr) = —& = NkT (2.7.25) 
WV) ry, 


This is again the thermal equation of state of the ideal gas, as found 
in Sect. 2.5. For the chemical potential, we find from (2.7.22), (2.7.24), 
and (2.7.23) 


V/N kT kT 
= KT og ( 8 ) = —kT log P= kT log P — kT log sy . (2.7.26) 


For the entropy, we find 


Of 5, OV [i 4 
s=-(F) = ah 53 Kr ( a?) eB 


é VIN (2.7.27) 
=kN (5 +108 8 ) ; 
and for the internal energy, from (2.7.14), we obtain 
E=6+T7S+ uN = NkT(-1+ >) 2 S NRT ; (2.7.28) 


4 For the reasons mentioned at the end of the preceding section, we replace E 
and N in (2.7.16) and (2.7.17) by E and N. 
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*2.7.5 The Grand Canonical Density Matrix 
in Second Quantization 


The derivation of pg can be carried out most concisely in the formalism 
of the second quantization. In addition to the Hamiltonian H, expressed in 
terms of the field operators (x) (see Eq. (1.5.6d) in QM II'®), we require 
the particle-number operator, Eq. (1.5.10)!° 


N= | Ba wl (x)p(x) . (2.7.29) 
V 
The microcanonical density matrix for fixed volume V is 
1 


6(H — E)6(N —N). (2.7.30) 


oe 1(B,N,V) 
Corresponding to the division of the overall volume into two subvolumes, V = 
Vi +V2, we have H = H,+H> and N = N,+Ny with N; = ty Bx wt (x)v(x), 
i = 1,2. We find from (2.7.30) the probability that the energy and the particle 
number in subvolume 1 assume the values EF; and N;: 


w(E1,Vi, Ni) 
1 ‘ i 
1 R 
= Th ON, Vie (E — E1))6(N2 — (N — 1)) 


x6(Ay = E,)5(Ny —_ Ni) 


_ (Fi, Ni, Vi)Ro(F- h,N-M,V—-MN) (2.7.31) 

a Q(E, N,V) _ 
The (grand canonical) density matrix for subsystem 1 is found by taking 
the trace of the density matrix of the overall system over subsystem 2, with 


respect to both the energy and the particle number: 


1 “ 
ab 6(H — F)6(N-—N 
pa = Doar yl ~ BW — N) 
P (2.7.32) 
_ (FE -— M™,N—Ni,V —V;i) 
7 2(E,N,V) 
Expansion of the logarithm of pg in terms of H, and N, leads to 
— gale (H — uM) /kT 
pe ae (2.7.33) 


J= tee Ca pNy)/kT 


consistent with Equations (2.7.7) and (2.7.8), which were obtained by con- 
sidering the probabilities. 


15 FB. Schwabl, Advanced Quantum Mechanics (QM ID), 3'¢ ed., Springer Berlin, 
Heidelberg, New York 2005. This text will be cited in the rest of this book 
as QM II. 
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Problems for Chapter 2 


2.1 Calculate 2(£) for a spin system which is described by the Hamiltonian 


N 
H Syen Yee; 


i=l 
where S; can take on the values S; = +1/2 
QEA= Yi. 
E<E,<E+A 


Use a combinatorial method, rather than 2.2.3.2. 


2.2 For a one-dimensional classical ideal gas, calculate (pi) and (pt). 


Tv 


Formula: if sin™ x cos” «dx = 


0 


2.3 A particle is moving in one dimension; the distance between the walls of the 
container is changed by a piston at L. Compute the change in the phase-space 
volume 92 = 2Lp (p = momentum). 

(a) For a slow, continuous motion of the piston. 

(b) For a rapid motion of the piston between two reflections of the particle. 


2.4 Assume that the entropy S depends on the volume 2(F) inside the energy 
shell: S = f((2). Show that from the additivity of S and the multiplicative character 
of (2, it follows that S = const x log 92. 


2.5 (a) For a classical ideal gas which is enclosed within a volume V, calculate 
the free energy and the entropy, starting with the canonical ensemble. 
(b) Compare them with the results of Sect. 2.2. 


2.6 Using the assertion that the entropy S = —k Tr (plog p) is maximal, show that 
with the conditions Tr p = 1 and Tr pH = E for p, the canonical density matrix 
results. 

Hint: This is a variational problem with constraints, which can be solved using the 
method of Lagrange multipliers. 


2.7 Show that for the energy distribution in the classical canonical ensemble 


w(E1) = pu px 6(H, — E1) 


a Tha) (2.7.34) 


2.8 Consider a system of N classical, non-coupled one-dimensional harmonic os- 
cillators and calculate for this system the entropy and the temperature, starting 
from the microcanonical ensemble. 
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2.9 Consider again the harmonic oscillators from problem 2.8 and calculate for this 
system the average value of the energy and the entropy, starting with the canonical 
ensemble. 


2.10 In analogy to the preceding problems, consider N quantum-mechanical non- 
coupled one-dimensional harmonic oscillators and compute the average value of the 
energy F and the entropy, beginning with the canonical ensemble. Also investigate 
limp;_so E, limp_.o S and limr_.o S, and compare the limiting values you obtain with 
the results of problem 2.9. 


2.11 For the Maxwell distribution, find 
(a) the average value of the nth power of the velocity (v"), (b) (v), (c) ((v— (v))?), 


(d) (ey? ((v? — (v?))?), and (e) the most probable value of the velocity. 


2.12 Determine the number of collisions of a molecule of an ideal gas with the wall 
of its container per unit area and unit time, when 

(a) the angle between the normal to the wall and the direction of the velocity lies 
between O and 0 + do; 

(b) the magnitude of the velocity lies between v and v + dv. 


2.13 Calculate the pressure of a Maxwellian gas with the velocity distribution 


f(v)=n (F2)he : 


Suggestions: the pressure is produced by reflections of the particles from the walls 
of the container; it is therefore the average force on an area A of wall which acts 
over a time interval 7. 


1 i 
P= — | dtF,(t). 
=| () 
0 


If a particle is reflected from the wall with the velocity v, its contribution is given 


os 
from Newton’s 2nd axiom in terms of f dt Fr(t) by the momentum transferred per 
0 


collision, 2mv,z. Then P = + > 2mvz,whereby the sum extends over all particles 
which reach the area A within the time rT. 
Result: P = nkT . 


2.14 A simple model for thermalization: Calculate the average kinetic energy of a 
particle of mass m, with the velocity v1 due to contact with an ideal gas consisting 
of particles of mass m2. As a simplification, assume that only elastic and linear 
collisions occur. The effect on the ideal gas can be neglected. It is helpful to use the 
abbreviations M = m1 + mz and m = m1 — m2. How many collisions are required 
until, for m1 4 mez, a temperature equal to the (1 — e~')-fold temperature of the 
ideal gas is attained? 
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2.15 Using the canonical ensemble, calculate the average value of the particle- 
number density 


N 


n(x) = Ste — xi) 


i=1 


for an ideal gas which is contained in an infinitely high cylinder of cross-sectional 
area A in the gravitational field of the Earth. The potential energy of a particle in 
the gravitational field is mgh. Also calculate 

(a) the internal energy of this system, 

(b) the pressure at the height (altitude) h, using the definition 


P= J bvc)meae F 


(c) the average distance (z) of an oxygen molecule and a helium atom from the 
surface of the Earth at a temperature of 0°C, and 

(d) the mean square deviation Az for the particles in 2.15c. 

At this point, we mention the three different derivations of the barometric pressure 
formula, each emphasizing different physical aspects, in R. Becker, Theory of Heat, 
2nd ed., Sec. 27, Springer, Berlin 1967. 


2.16 The potential energy of N non-interacting localized dipoles depends on their 
orientations relative to an applied magnetic field H: 


N 
1 —pH. S~ cos 0; : 
i=l 


Calculate the partition function and show that the magnetization along the z- 
direction takes the form 


N 
M. = (So 108 v3) = NwL(6uH.); L(x) = Langevin function . 
i=1 


Plot the Langevin function. 
How large is the magnetization at high temperatures? Show that at high tempera- 
tures, the Curie law for the magnetic susceptibility holds: 


; OM, 
X= iim, (SF) ~ const /T . 


2.17 Demonstrate the equipartition theorem and the virial theorem making use of 
the microcanonical distribution. 


2.18 In the extreme relativistic case, the Hamilton function for N particles in 
three-dimensional space is H = )7, |pi|c. Compute the expectation value of H with 
the aid of the virial theorem. 


2.19 Starting with the canonical ensemble of classical statistics, calculate the equa- 
tion of state and the internal energy of a gas composed of N indistinguishable 
particles with the kinetic energy ¢(p) = |p| -c. 
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2.20 Show that for an ideal gas, the probability of finding a subsystem in the 
grand canonical ensemble with N particles is given by the Poisson distribution: 


where N is the average value of N in the ideal gas. 
Suggestions: Start from py = e?(?+%“) Zy. Express ®, u, and Zy in terms of N. 


2.21 (a) Calculate the grand partition function for a mixture of two ideal gases 
(2 chemical potentials!). 
(b) Show that 


PV =(Ni+N2)kT and 
E= 5 (Ni + N2)kT 


are valid, where Ni, No and F are the average particle number and the average 
energy. 


2.22 (a) Express FE by taking an appropriate derivative of the grand partition 
function. _ 
(b) Express (AE)? in terms of a thermodynamic derivative of E. 


2.23 Calculate the density matrix in the xz-representation for a free particle within 
a three-dimensional cube of edge length L: 


p(x, x’) = cy e 8Fn (x\n) (n|a') 


n 


where c is a normalization constant. Assume that L is so large that one can go to 
the limit of a continuous momentum spectrum 


L3d°p Li ecoip 
Sf ep le) — le) = ae". 


2.24 Calculate the canonical density matrix for a one-dimensional harmonic oscil- 


lator H = —(h?/2m)(d? /d?x) + noe in the x-representation at low temperatures: 


p(x, x’) = cy e BEn (a|n) (n|a') ; 


where c is the normalization constant. 


h 
(2|n) = (ni/? 2” n!I ao) /2¢@7 (#/0)"/2 Hy (=) 5; £o=14/—. 
x0 wm 


The Hermite polynomials are defined in problem 2.27. 
Suggestion: Consider which state makes the largest contribution. 


2.25 Calculate the time average of q? for the example of problem 1.7, as well as 
its average value in the microcanonical ensemble. 
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2.26 Show that: 


fat. daa f@’.a-k) 
= Qn) Ka-n f agg" [ dO(sin@)*? f(q’,qkcos@) , (2.7.35) 
0 0 
where k € R? is a fixed vector and q = |q|,k = |k|, and Kg = 2-¢*1n-4/? x 
Gey 


2.27 Compute the matrix elements of the canonical density matrix for a one- 
dimensional harmonic oscillator in the coordinate representation, 


Pra! = (al ple’) = (ale |e’) . 


Hint: Use the completeness relation for the eigenfunctions of the harmonic oscilla- 
tor and use the fact that the Hermite polynomials have the integral representation 


Ay(§) = (—1)"e& (4) ‘ee = = if (—2in) ent dy . 


Alternatively, the first representation for H,(x) and the identity from the next 
example can be used. 
Result: 


1/2 
1 mw 
Les) Og = 


x exp —2 ((« +2’)? tanh 5h + (x — x’)? etgh shu) \ . (2.7.36) 


2.28 Prove the following identity: 


a a 1 A 
bu ae tA — e tIs4an® . 


Det(1 + 4AT/) 


e 


Here, IT and A ate two commuting symmetric matrices, e.g. 21 2 = ie ITix a ; 
i 


3. Thermodynamics 


3.1 Thermodynamic Potentials and the Laws 
of Equilibrium Thermodynamics 


3.1.1 Definitions 


Thermodynamics treats the macroscopic properties of macroscopic systems. 
The fact that macroscopic systems can be completely characterized by a small 
number of variables, such as their energy EF, volume V, and particle num- 
ber N, and that all other quantities, e.g. the entropy, are therefore functions 
of only these variables, has far-reaching consequences. 

In this section, we consider equilibrium states and transitions from one 
equilibrium state to another neighboring equilibrium state. In the preceding 
sections, we have already determined the change in the entropy due to changes 
in EF, V and N, whereby the system goes from one equilibrium state EF, V, N 
into a new equilibrium state # + dE, V+dV, N+dN. Building upon the 
differential entropy (2.4.29), we will investigate in the following the First 
Law and the significance of the quantities which occur in it. Beginning with 
the internal energy, we will then define the most important thermodynamic 
potentials and discuss their properties. 

We assume the system we are considering to consist of one single type of 
particles of particle number N. We start with its entropy, which is a function 
of E, V, and N. 

Entropy : S=S(E,V,N) 
In (2.4.29), we found the differential entropy to be 
1 


£ iz 
=—dE4 dN . zal 
dS = = ao oe (3.1.1) 


From this, we can read off the partial derivatives: 


os aoe 08 =a as aoe eae aly) 
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which naturally agree with the definitions from equilibrium statistics. We can 
now imagine the equation S = S(E,V,N) to have been solved for E and 
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thereby obtain the energy FE, which in thermodynamics is usually termed the 
internal energy, as a function of S, V, and N. 

Internal Energy : E = E(S,V,N) 

From (3.1.1), we obtain the differential relation 


dE =TdS — PdV + pdN . (3.1.3) 


We are now in a position to interpret the individual terms in (3.1.3), 
keeping in mind all the various possibilities for putting energy into a system. 
This can be done by performing work, by adding matter (i.e. by increasing 
the number of particles), and through contact with other bodies, whereby 
heat is put into the system. The total change in the energy is thus composed 
of the following contributions: 


dE = 6Q + bW + 5Ey (3.1.3') 
| 
heat input 
mechanical work 


energy increase through addition of matter . 
The second term in (3.1.3) is the work performed on the system, 
5W = —PdV , (3.1.4a) 


while the third term gives the change in the energy on increasing the particle 
number 


SEN =pdN . (3.1.4b) 


The chemical potential has the physical meaning of the energy increase on 
adding one particle to the system (at constant entropy and volume). The first 
term must therefore be the energy change due to heat input dQ, ie. 


6Q=TdS . (3.1.5) 


Relation (3.1.3), the law of conservation of energy in thermodynamics, is 
called the First Law of Thermodynamics. It expresses the change in energy 
on going from one equilibrium state to another, nearby state an infinitesimal 
distance away. Equation (3.1.5) is the Second Law for such transitions. We 
will formulate the Second Law in a more general way later. In this connection, 
we will also clarify the question of under what conditions these relations of 
equilibrium thermodynamics can be applied to real thermodynamic processes 
which proceed at finite rates, such as for example the operation of steam 
engines or of internal combustion engines. 
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Remark: 


It is important to keep the following in mind: dW and dQ do not represent 
changes of state variables. There are no state functions (functions of E, V 
and NV) identifiable with W und Q. An object cannot be characterized by 
its ‘heat or work content’, but instead by its internal energy. Heat (~ energy 
transfer into an object through contact with other bodies) and work are ways 
of transferring energy from one body to another. 

It is often expedient to consider other quantities — with the dimensions 
of energy — in addition to the internal energy itself. As the first of these, we 
define the free energy: 

Free Energy (Helmholtz Free Energy) : F = F(T,V,N) 
The free energy is defined by 


F=E-TS (= -kTlogZ(T,V.N)) ; (3.1.6) 


in parentheses, we have given its connection with the canonical partition 
function (Chap. 2). From (3.1.3), the differential free energy is found to be: 


dF = —SdT — PdV + pdN Gay) 


with the partial derivatives 


OF OF OF 
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We can see from (3.1.8) that the internal energy can be written in terms of F’ 
in the form 


7 OF ee OP 
p=P-1(5e) =-2 (srr),., (3.1.9) 


From (3.1.7), it can be seen that the free energy is that portion of the energy 
which can be set free as work in an isothermal process; here we assume that 
the particle number N remains constant. In an isothermal volume change, 
the change of the free energy is given by (dF)r,.n = —PdV = dW, while 
(dE)r,.n # 6A, since one would have to transfer heat into or out of the 
system in order to hold the temperature constant. 

Enthalpy : H = H(S,P,N) 

The enthalpy is defined as 


H=E+PV. (3.1.10) 
From (3.1.3), it follows that 
dH =TdS + VdP + dN G.141) 


and from this, its partial derivatives can be obtained: 
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oH OH OH 
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For isobaric processes, (dH) py = TdS = 6Q = dE + PdV, thus the change 
in the enthalpy is equal to the change in the internal energy plus the energy 
change in the device supplying constant pressure (see Fig. 3.1). The weight Fe 
including the piston of area A holds the pressure constant at P = Fg¢/A. The 
change in the enthalpy is the sum of the change in the internal energy and the 
change in the potential energy of the weight. For a process at constant pres- 
sure, the heat dQ supplied to the system equals the increase in the system’s 
enthalpy. 


Fe 
= ZA 
Fig. 3.1. The change in the enthalpy in isobaric 
processes; the weight Faq produces the constant 
pressure P = Fa@/A, where A is the area of the 
piston. 
Free Enthalpy (Gibbs’ Free Energy) : G = G(T, P,N) 


The Gibbs’ free energy is defined as 

G=E-TS+PV. (3.1.13) 
Its differential follows from (3.1.3): 

dG =—SdT+VdP+udN . (3.1.14) 


From Eq. (3.1.14), we can immediately read off 


OG OG OG 
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(ae af ae US Cae He Oe) 


The Grand Potential : & = D(T,V, 1) 
The grand potential is defined as 


@=E-TS-pN  (=—kTlogZc(T,V,p)) ; (3.1.16) 


in parentheses we give the connection to the grand partition function 
(Chap. 2). The differential expressions are 


d® = —SdT — PdV — Nd, (3.1.17) 


Of Of O@ 
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3.1.2 The Legendre Transformation 


The transition from E to the thermodynamic potentials defined in (3.1.6), 
(3.1.10), (3.1.13), and (3.1.16) was carried out by means of so-called Legendre 
transformations, whose general structure will now be considered. We begin 
with a function Y which depends on the variables x1, x2,..., 


Y = Y(a1,2o,...). (3.1.19) 
The partial derivatives of Y in terms of the x; are 
OY 
aj(@71,%2,...) = (5 ) . (3.1.20a) 
TiS {05,5Ai} 
Our goal is now to replace the independent variable x; by the partial deriva- 


tives (#) as independent variables, i.e. for example to change from the 


independent variable S to T. This has a definite practical application, since 
the temperature is directly and readily measurable, while the entropy is not. 
The total differential of Y is given by 


dY =a,dx, + aod%2+... (3.1.20b) 
From the rearrangement dY = d(a,21) — x1 da, + agdr2 + ..., it follows that 

d(Y — a,21) = —a1da; + agdrg4+... . (3.1.21) 
It is then expedient to introduce the function 

Y%=Y-az., (3.1.22) 
and to treat it as a function of the variables a1, x72,... (natural variables).' 


Thus, for example, the natural variables of the (Helmholtz) free energy 
are T, V, and N. The differential of Yi(a1,72,...) has the following form 
in terms of these independent variables: 


dY, = —2,da; + a9d%9 +... (3.1.21/a) 


and its partial derivatives are 


OY, _ OY, — / 
a: =-T1, er = QAQ,.-.. (3.1.21 b) 


In this manner, one can obtain 8 thermodynamic potentials corresponding to 
the three pairs of variables. Table 3.1 collects the most important of these, 
i.e. the ones already introduced above. 


' We make an additional remark here about the geometric significance of the 
Legendre transformation, referring to the case of a single variable: a curve can 
be represented either as a series of points Y = Y(a1), or through the family 
of its envelopes. In the latter representation, the intercepts of the tangential 
envelope lines on the ordinate as a function of their slopes ai are required. This 
geometric meaning of the Legendre transformation is the basis of the construction 
of G(T, P) from F(T,V) shown in Fig. 3.33. [If one simply eliminated x in 
Y = Y(qa1) in favor of ai, then one would indeed obtain Y as a function of a1, 
but it would no longer be possible to reconstruct Y (x1)]. 
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Table 3.1. Energy, entropy, and thermodynamic potentials 


Independent 


State function eaabies Differentials 
rae S,V,{Nj} dE =TdS— PdV +X pjdNj 
Jj 
ee B,V,{Nj} dS = pd + FaV — YBN 
Jj 
Free Ener ; 
ook ae T,V,{N;} dF SdT — PdV 4 2s HyGNs 
Enthalpy : = dN; 
H=E+PV 5, P, {N;} dH AGES GEST DO HAG 
Gibbs’ Free Energy ; dG — —SdT dP dN; 
G=E-TS+PV T, P, {Nj} G SdT + V +H j 
Grand Potential 
$=E-TS—YpjN; T,V, {5} d& = —SdT — PdV D Nyjdu; 
J Jj 


This table contains the generalization to systems with several components (see 
Sect. 3.9). Nj and jj are the particle number and the chemical potential of the j-th 
component. The previous formulas are found as a special case when the index 7 
and >), are omitted. 


F, H, G and @ are called thermodynamic potentials, since taking their 
derivatives with respect to the natural independent variables leads to the 
conjugate variables, analogously to the derivation of the components of force 
from the potential in mechanics. For the entropy, this notation is clearly less 
useful, since entropy does not have the dimensions of an energy. E, F, H, G 
and @ are related to each other through Legendre transformations. The nat- 
ural variables are also termed canonical variables. In a system consisting of 
only one chemical substance with a fixed number of particles, the state is 
completely characterized by specifying two quantities, e.g. T and V or V 
and P. All the other thermodynamic quantities can be calculated from the 
thermal and the caloric equations of state. If the state is characterized by T 
and V, then the pressure is given by the (thermal) equation of state 


P=P(T,V). 


(The explicit form for a particular substance is found from statistical me- 
chanics.) If we plot P against T and V in a three-dimensional graph, we 
obtain the surface of the equation of state (or PVT surface); see Fig. 2.7 and 
below in Sect. 3.8. 
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3.1.3 The Gibbs—Duhem Relation in Homogeneous Systems 


In this section, we will concentrate on the important case of homogeneous 
thermodynamic systems.” Consider a system of this kind with the energy E, 
the volume V, and the particle number N. Now we imagine a second system 
which is completely similar in its properties but is simply larger by a factor a. 
Its energy, volume, and particle number are then af, aV, and aN. Owing 
to the additivity of the entropy, it is given by 


S(aE,aV,aN) =aS(E,V,N). (3.1.23) 


As a result, the entropy S is a homogeneous function of first order in EF, V 
and N. Correspondingly, EF is a homogeneous function of first order in S, V 
and N. 

There are two types of state variables: 

E, V, N, S, F, H, G, and @are called extensive, since they are proportional 
to @ when the system is enlarged as described above. T, P, and wu are 
intensive, since they are independent of a; e.g. we find 


OS das 0 
= SSS SS OO 

OE Oak , 
and this independence follows in a similar manner from the definitions of the 
other intensive variables, also. We wish to investigate the consequences of the 


homogeneity of S [Eq. (3.1.23)]. To this end, we differentiate (3.1.23) with 
respect to @ and then set a = 1: 


( Os OS OS y) 


Tol 
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From this, we find using (3.1.2) that —S 4 


=S'. 


a=1 


Lp .P = + 
pi+ GV — EN =), that is 


E=TS—PV+uN. (3.1.24) 


This is the Gibbs—Duhem relation. Together with dE = TdS — PdV + udN, 
we derive from Eq. (3.1.24) 


SdT —VdP + Ndu=0, (3.1.24’) 


the differential Gibbs—Duhem relation. It states that in a homogeneous sys- 
tem, T, P and p cannot be varied independently, and it gives the relationship 
between the variations of these intensive quantities.? The following expres- 
sions can be derived from the Gibbs—Duhem relation: 


? Homogeneous systems have the same specific properties in all spatial regions; 
they may also consist of several types of particles. Examples of inhomogeneous 
systems are those in a position-dependent potential and systems consisting of 
several phases which are in equilibrium, although in this case the individual 
phases can still be homogeneous. 

3 The generalization to systems with several components is given in Sect. 3.9, 
Eq. (3.9.7). 
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G(T, P,N) = u(T, P) N (3.1.25) 
and 


OT, V, py) =—P(T, pV. (3.1.26) 


Justification: from the definition (3.1.13), it follows immediately using (3.1.24) that 


G = wN, and from (3.1.15) we find p = ($2) -p =p (24) 2 NG it follows 


that ;4 must be independent of N. We have thus demonstrated (3.1.25). Similarly, 


it follows from (3.1.16) that 6 = —PV, and due to —P = eae P must be 


independent of V. 
Further conclusions following from homogeneity (in the canonical ensemble with 
independent variables T, V, and N) can be obtained starting with 


P(T,V,N)=P(L,aV,aN) and p(T,V,N) = pu(T,aV,aN) (3.1.27a,b) 


again by taking derivatives with respect to a around the point a = 1: 


oP OP - On On = 
a (sr) ,,.¥=° me (sr), ,¥* (aw), =" 
(3.1.28a,b) 


These two relations merely state that for intensive quantities, a volume increase is 
equivalent to a decrease in the number of particles. 


3.2 Derivatives of Thermodynamic Quantities 


3.2.1 Definitions 


In this section, we will define the most important thermodynamic derivatives. 

In the following definitions, the particle number is always held constant. 
The heat capacity is defined as 

_ 6Q _ dS 


C= aT Tr . (3.2.1) 
It gives the quantity of heat which is required to raise the temperature of 
a body by 1K. We still have to specify which thermodynamic variables are 
held constant during this heat transfer. The most important cases are that 
the volume or the pressure is held constant. If the heat is transferred at 
constant volume, the heat capacity at constant volume is relevant: 


Os OE 
ey (ari, Gar are 
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In rearranging (05/0T)y,n, we have used Eq. (3.1.1). If the heat transfer 
takes place under constant pressure, then the heat capacity at constant pres- 
sure from (3.2.1) must be used: 


Os OH 


For the rearrangement of the definition, we employed (3.1.11). If we divide the 
heat capacity by the mass of the substance or body, we obtain the specific 
heat, in general denoted as c, or cy at constant volume or cp at constant 
pressure. The specific heat is measured in units of J kg—' K~!. The specific 
heat may also be referred to 1 g and quoted in the (non-SI) units cal g~' K7!. 
The molar heat capacity (heat capacity per mole) gives the heat capacity of 
one mole of the substance. It is obtained from the specific heat referred to 
1 g, multiplied by the molecular weight of the substance. 


Remark: We will later show in general using Eq. (3.2.24) that the specific heat at 
constant pressure is larger than that at constant volume. The physical origin of this 
difference can be readily seen by writing the First Law for constant N my the form 
. EB E E 
ae dE + PdV and setting dE = (Sry dT + ($7) ,dV = CydT 4 (35) ,4aV, 
at is 


OE 
6Q = CydT 4 [P (Sr), dv . 


In addition to the quantity of heat CydT necessary for warming at constant volume, 
when V is increased, more heat is consumed by the work against the pressure, PdV, 
and by the change in the internal energy, (OE /0V)r dV. For Cp = (32) it then 
follows from the last relation that 


Cp =Cy 4 (P+ (sr).) Gos 


Further important thermodynamic derivatives are the compressibility, the 
coefficient of thermal expansion, and the thermal pressure coefficient. The 
compressibility is defined in general by 


_ ld 
VdP- 
It is a measure of the relative volume decrease on increasing the pressure. 


For compression at a constant temperature, the isothermal compressibility, 
defined by 


1 /OV 
2, 
KT ( ) a (3.2.3a) 


is the relevant quantity. For (reversible) processes in which no heat is trans- 
ferred, i.e. when the entropy remains constant, the adiabatic (isentropic) com- 
pressibility 
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must be introduced. The coefficient of thermal expansion is defined as 
1 (OV 
=—(|— : 3.2.4 
=) ny a 
The definition of the thermal pressure coefficient is given by 
1 (OP 
=— |{ — . 3.2.5 
0-35 ie 


Quantities such as C’, «, and a are examples of so-called susceptibilities. They 
indicate how strongly an extensive quantity varies on changing (increasing) 
an intensive quantity. 


3.2.2 Integrability and the Maxwell Relations 


3.2.2.1 The Maxwell Relations 


The Maxwell relations are expressions relating the thermodynamic deriva- 
tives; they follow from the integrability conditions. From the total differential 
of the function Y = Y (#1, x2) 


dY = a,dx, + agdre , (3.2.6) 
OY OY 
ay = —- a= —_— 
: 0x4 xe , 3 Ox 21 


we find as a result of the commutatitivity of the order of the derivatives, 


ea = xt = ot = (322)... the following integrability condition: 


Oa, Oag 
(SE). - fee ea 


All together, there are 12 different Maxwell relations. The relations for fixed 
N are: 


© (-@B, © (8), G), em 
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bs (ine cere: 210 


3.2 Derivatives of Thermodynamic Quantities 85 


Here, we have labeled the Maxwell relations with the quantity from whose dif- 
ferential the relation is derived. There are also relations containing N and p; 
of these, we shall require the following in this book: 


OL - OP 
F: (sr), —_ a oe (3.2.11) 


Applying this relation to homogeneous systems, we find from (3.1.28a) and 
(3.1.28b): 
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*3.2.2.2 Integrability Conditions, Exact and Inexact Differentials 


It may be helpful at this point to show the connection between the integra- 
bility conditions and the results of vector analysis as they apply to classi- 
cal mechanics. We consider a vector field F(x), which is defined within the 
simply-connected region G (this field could for example be a force field). Then 
the following statements are equivalent: 


(1) F(x) = -VV(x) 


with V(x) = ey dx'F(x’), where xo is an arbitrary fixed point of origin 
and the line integral is to be taken along an arbitrary path from xg to x. 
This means that F(x) can be derived from a potential. 


(II) curlF =0 at each point in G. 
(III) ¢dxF(x)=0 along each closed path in G. 
¥ 30) oa ie 
(IV) fy, dx F(x) is independent of the path. 
Let us return to thermodynamics. We consider a system characterized 


by two independent thermodynamic variables 2 and y and a quantity whose 
differential variation is given by 


dY = A(a,y)dx + B(x, y)dy . (3.2.13) 


In the notation of mechanics, F = (A(z,y), B(x, y),0). The existence of a 
state variable Y, i.e. a state function Y(#,y) (Statement (I’)) is equivalent 
to each of the three other statements (II’,III’, and IV’). 


(I) A state function Y (a, y) exists, with 
Y(a, y) = Y (20, Yo) + ee (da! A(2’, y') + dy’ B(2’, y')). 
dB) _ (aA 
a) (82), = (34), 
(I) $(drA(a,y) + dyB(z,y)) =0 
(IV’) Be (dxA(x,y) + dyB(x,y)) is independent of the path. 
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Fig. 3.2. Illustrating the path integrals III’ 
and IV’ 


The differential (3.2.13) is called an exact differential (or a perfect differ- 
ential) when the coefficients A and B fulfill the integrability condition (II’). 


3.2.2.3 The Non-integrability of 6Q and dW 


We can now prove that 6Q and 6W are not integrable. We first consider (W 
and imagine the independent thermodynamic variables to be V and T. Then 
the relation (3.1.4a) becomes 


65W =—PdV +0-dT. (3.2.14) 


The derivative of the pressure with respect to the temperature at constant 
volume is nonzero, (4), # 0, while of course the derivative of zero with 
respect to V gives zero. That is, the integrability condition is not fulfilled. 
Analogously, we write (3.1.5) in the form 


5Q=TdS +0-dV. (3.2.15) 
as ap 
Again, we have (3) 5 = a = — # 0, ie. the integrability 
Vv Vv 


condition is not fulfilled. Therefore, there are no state functions W(V,T, N) 
and Q(V,T, N) whose differentials are equal to )W and 6Q. This is the rea- 
son for the different notation used in the differential signs. The expressions 
relating the heat transferred to the system and the work performed on it to 
the state variables exist only in differential form. One can, of course, com- 
pute the integral [, 6Q = J, TdS along a given path (e.g. 1 in Fig. 3.2), and 
similarly for dW, but the values of these integrals depend not only on their 
starting and end points, but also on the details of the path which connects 
those points. 


Remark: 


In the case that a differential does not fulfill the integrability condition, 


dY = A(z, y)dx + B(x, y)dy , 
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but can be converted into an exact differential through multiplication by a 
factor g(x,y), then g(a,y) is termed an integrating factor. Thus, > is an 
integrating factor for dQ. In statistical mechanics, it is found quite naturally 
that the entropy is a state function, i.e. dS is an exact differential. In the 
historical development of thermodynamics, it was a decisive and nontrivial 
discovery that multiplication of 6Q by z yields an exact differential. 


3.2.3 Jacobians 


It is often necessary to transform from one pair of thermodynamic variables 
to a different pair. For the necessary recalculation of the thermodynamic 
derivatives, it is expedient to use Jacobians. 

In the following, we consider functions of two variables: f(u, v) and g(u, v). 
We define the Jacobian determinant: 


se Pe ay" |~ (35), (B),- (30), (3), - 2219 


Ou ov 


This Jacobian fulfills a series of important relations. 
Let u = u(x, y) and v = u(x, y) be functions of x and y; then the following 
chain rule can be proved in an elementary fashion: 


Af,g) _ Af,g9) Wu, ») (3.2.17) 


A(a,y) — (u,v) A(x, y) © 


This relation is important for the changes of variables which are frequently 
needed in thermodynamics. Setting g = v, the definition (3.2.16) is simplified 
to 


— 7 (3). (3.2.18) 


Since a determinant changes its sign on interchanging two columns, we have 


Af.g) _ _ Af,g) (3.2.19) 


O(v, u) O(u,v) - 
If we apply the chain rule (3.2.17) for « = f and y = g, we find: 


Af, g) Ou,v) _ 
Huo) Bg) 7 (3.2.20) 


Setting g = v in (3.2.20), we obtain with (3.2.18) 


(34). = Gy. . (3.2.20’) 
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Finally, from (3.2.18) we have 


(54) _ Of,v) _ Af,v) AF,u) _ (3). meas 
Ou), OAu,v) OA(f,u) O(u,v) Cr . 2. 


Using this relation, one can thus transform a derivative at constant v into 
derivatives at constant u and f. The relations given here can also be applied 
to functions of more than two variables, provided the additional variables are 
held constant. 


3.2.4 Examples 


(i) We first derive some useful relations between the thermodynamic deriva- 
tives. Using Eqns. (3.2.21), (3.2.3a), and (3.2.4), we obtain 


oP (Fr) p _ @ 
(sz), ma aoe (3.2.22) 


Thus, the thermal pressure coefficient @ = 4 ($), [Eq. (3.2.5)] is related 
to the coefficient of thermal expansion a and the isothermal compressibility 
«. In problem 3.4, it is shown that 


C 
ee = ie (3.2.23) 
[cf. (3.2.3a,b)]. Furthermore, we see that 


_,,0(5,V) _ ,,A(S,V) A(T, P) _ 
Cv =P 5TV) =P (T.P) OIT.V) e 
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Here, the Maxwell relation (3.2.10) was used. Thus we find for the heat 
capacities 


TVa? 
KT , 
With KT Cp —_— KT Cy = TV a? and KT Cy = KS Ce, it follows that the 
compressibilities obey the relation 
TVa? 
Gp? 
It follows from (3.2.24) that the two heat capacities can become equal only 


when the coefficient of expansion a vanishes or kr becomes very large. The 
former occurs in the case of water at 4°C. 


Cp=0y = 


(3.2.24) 


(3.2.25) 


KT — hs = 
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(ii) We now evaluate the thermodynamic derivatives for the classical ideal 
gas, based on Sect. 2.7 . For the enthalpy H = E+ PV, it follows from 
Eqns. (2.7.25) and (2.7.28) that 


5 


A= gNkT : (3.2.26) 
Then, for the heat capacities, we find 
OE 3 OH 5 
=({—] =-=Nk =(—~—) =-Nk; 2.2 
Cy (=), Nk, Cp (Fr). NR; (3.2.27) 
and for the compressibilities, 
1 (OV 1 Cy 3 
=> — => SS = 2.2 
ae Ta (55). pe Se Gp  5P° e228) 


finally, for the thermal expansion coefficient and the thermal pressure coeffi- 
cient, we find 


a= 5 (Fr) Sos 6=5(F) fe = *. Gone) 
i Vv 


V T P 


3.3 Fluctuations and Thermodynamic Inequalities 


This section is concerned with fluctuations of the energy and the particle 
number, and belongs contextually to the preceding chapter. We are only now 
treating these phenomena because the final results are expressed in terms of 
thermodynamic derivatives, whose definitions and properties are only now at 
our disposal. 


3.3.1 Fluctuations 


1. We consider a canonical ensemble, characterized by the temperature T, 
the volume V, the fixed particle number N, and the density matrix 


Z=Tre 8! . 


The average value of the energy [Eq. (2.6.37)] is given by 
= 1 1 OZ 
B= sTre *H= =~. ach 
give ZOD) (3.3.1) 
Taking the temperature derivative of (3.3.1), 


OE 1! JE 1 2 2 1 3 
(Gr), kT? O(—B) kT? [(H") — (H)"] = ppl Ak) 
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we obtain after substitution of (3.2.2a) the following relation between the 
specific heat at constant volume and the mean square deviation of the internal 
energy: 
1 
C 


Vv = ppa(Ak)’. (3.3.2) 


2. Next, we start with the grand canonical ensemble, characterized by 
T, V, pu, and the density matrix 


pa= Zee Ey) Zo =Tt e BUA- EN) | 


The average particle number is given by 


_ G 
N=Tr poN =kT ee am (3.3.3) 
Lb 


Its derivative with respect to the chemical potential is 


ON : 

( Op = B((N?) — N?) = B(AN)?. 
H/ Tv 

If we replace the left side by (3.2.12), we obtain the following relation between 

the isothermal compressibility and the mean square deviation of the particle 

number: 


aE fOVy .VefON\ ; 
= (FE) ay =H Gin ny 7 RON fond 


Eqns. (3.3.2) and (3.3.4) are fundamental examples of relations between sus- 
ceptibilities (on the left-hand sides) and fluctuations, so called fluctuation- 
response theorems. 


3.3.2 Inequalities 


From the relations derived in 3.3.1, we derive (as a result of the positivity of 
the fluctuations) the following inequalities: 


Kr 20, (3.3.5) 


Cp>Cy>0. (3.3.6) 


In (3.3.6), we have used the fact that according to (3.2.24) and (3.3.5), Cp is 
larger than Cy. On decreasing the volume, the pressure increases. On increas- 
ing the energy, the temperature increases. The validity of these inequalities 
is a precondition for the stability of matter. If, for example, (3.3.5) were not 
valid, compression of the system would decrease its pressure; it would thus 
be further compressed and would finally collapse. 
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3.4 Absolute Temperature and Empirical Temperatures 


OE 
Experimentally, one uses a temperature v, which is for example given by the 
length of a rod or a column of mercury, or the volume or the pressure of 
a gas thermometer. We assume that the empirical temperature J increases 
monotonically with T, i.e. that @ also increases when we put heat into the 
system. We now seek a method of determining the absolute temperature 
from U, that is, we seek the relation T = T(#). To this end, we start with the 


Rey . 5Q 
thermodynamic difference quotient (33) - 


6Q\ __/as aVv\ —  (av\ ao 
(3B) = 7 (Se), = (ar) = a) ae ee 


The absolute temperature was defined in (2.4.4) as T~! = (Ao) : 
VN 


Here, we have substituted in turn 0Q = TdS, the Maxwell relation (3.2.10), 
and T = T(v). It follows that 


rit ~ a - (3), (33), ; (3.4.2) 
T 


This expression is valid for any substance. The right-hand side can be mea- 
sured experimentally and yields a function of 0. Therefore, (3.4.2) represents 
an ordinary inhomogeneous differential equation for T(), whose integration 
yields 


T =const-: f(v) . (3.4.3) 


We thus obtain a unique relation between the empirical temperature J and 
the absolute temperature. The constant can be chosen freely due to the ar- 
bitrary nature of the empirical temperature scale. The absolute temperature 
scale is determined by defining the triple point of water to be T; = 273.16 K. 


For magnetic thermometers, it follows from (28)... aed i (3), ae bs (ou ) eS 
(cf. Chap. 6), analogously, 
1 dT OM dB 
Ta (Sr) Ga), (3.4.4) 
The absolute temperature 
as\~* 
T= (3) (3.4.5) 
OE) yn 


is positive, since the number of accessible states (« 2(F)) is a rapidly increas- 
ing function of the energy. The minimum value of the absolute temperature 
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is T = 0 (except for systems which have energetic upper bounds, such as an 
assembly of paramagnetic spins). This follows from the distribution of energy 
levels FE in the neighborhood of the ground-state energy Eo. We can see from 
the models which we have already evaluated explicitly (quantum-mechanical 
harmonic oscillators, paramagnetic moments: Sects. 2.5.2.1 and 2.5.2.2) that 
limg—.z, S’(Z) = co, and thus for these systems, which are generic with 
respect to their low-lying energy levels, 
lim T=0. 
E-Eo 

We return once more to the determination of the temperature scale through 
Eq. (3.4.3) in terms of T;= 273.16 K. As mentioned in Sect. 2.3, the value of 
the Boltzmann constant is also fixed by this relation. In order to see this, we 
consider a system whose equation of state at T; is known. Molecular hydrogen 
can be treated as an ideal gas at T; and P = 1 atm. The density of Hz under 
these conditions is 


p = 8.989 x 1072g/liter = 8.989 x 107-°g/cm™? . 


Its molar volume then has the value 
= 2.016 g 
8.989 x 10-2 g liters~! 


Vu = 22.414 liters . 

One mole is defined as: 1 mole corresponds to a mass equal to the atomic 
weight in g (e.g. a mole of Hz has a mass of 2.016 g). From this fact, we can 
determine the Boltzmann constant: 


PV 1 atm Viz 16 
Wr Ne ome ee 
= 1.38066 x 10773. /K . (3.4.6) 


Here, Avogadro’s number was used: 


Na, = number of molecules per mole 
2.016 g _ 2.016 g 
mass of Hp =. 2 x: 1.6734 x 10-24 


= 6.0221 x 1072 mol”! . 


Further definitions of units and constants, e.g. the gas constant R, are given in 
Appendix I. 


3.5 Thermodynamic Processes 


In this section, we want to treat thermodynamic processes, i.e. processes 
which either during the whole course of their time development or at least 
in their initial or final stages can be sufficiently well described by thermody- 
namics. 
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3.5.1 Thermodynamic Concepts 


We begin by introducing several concepts of thermodynamics which we will 
later use repeatedly (cf. Table 3.2). 

Processes in which the pressure is held constant, i.e. P = const, are 
called isobaric; those in which the volume remains constant, V = const, are 
isochoral; those in which the entropy is constant, S = const, are isentropic; 
and those in which no heat is transferred, i.e. 6Q = 0, are termed adiabatic 
(thermally isolated). 


Table 3.2. Some thermodynamic concepts 


Concept Definition 


isobaric P=const. 

isochoral V = const. 

isothermal T= const. 

isentropic S = const. 

adiabatic 6Q =0 

extensive proportional to the size of the system 


intensive independent of the size of the system 


We mention here another definition of the terms extensive and intensive, 
which is equivalent to the one given in the section on the Gibbs—Duhem 
relation. We divide a system that is characterized by the thermodynamic 
variable Y into two parts, which are themselves characterized by Y; and Y. 
In the case that Y; + Yo = Y, Y is called extensive; when Y; = Y2 = Y, it is 
termed intensive (see Fig. 3.3). 


extensive: 


intensive: Fig. 3.3. The definition of extensive and in- 
tensive thermodynamic variables 


Extensive variable include: V, N, E, S, the thermodynamic potentials, the 
electric polarization P, and the magnetization M. 
Intensive variables include: P, p, T, the electric field E, and the magnetic 
field B. 

Quasistatic process: a quasistatic process takes place slowly with respect 
to the characteristic relaxation time of the system, i.e. the time within which 
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the system passes from a nonequilibrium state to an equilibrium state, so that 
the system remains in equilibrium at each moment during such a process. 
Typical relaxation times are of the order of tT = 10~!° — 107° sec. 

An irreversible process is one which cannot take place in the reverse direc- 
tion, e.g. the transition from a nonequilibrium state to an equilibrium state 
(the initial state could also be derived from an equilibrium state with restric- 
tions by lifting of those restrictions). Experience shows that a system which 
is not in equilibrium moves towards equilibrium; in this process, its entropy 
increases. The system then remains in equilibrium and does not return to the 
nonequilibrium state. 

Reversible processes: reversible processes are those which can also occur in 
the reverse direction. An essential attribute of reversibility is that a process 
which takes place in a certain direction can be followed by the reverse process 
in such a manner that no changes in the surroundings remain. 

The characterization of a thermodynamic state (with a fixed particle num- 
ber NV) can be accomplished by specifying two quantities, e.g. T and V, or P 
and V. The remaining quantities can be found from the thermal and the 
caloric equations of state. A system in which a quasistatic process is occur- 
ring, i.e. which is in thermal equilibrium at each moment in time, can be 
represented by a curve, for example in a P-V diagram (Fig. 2.7b). 

A reversible process must in all cases be quasistatic. In non-quasistatic 
processes, turbulent flows and temperature fluctuations take place, leading 
to the irreversible production of heat. The intermediate states in a non- 
quasistatic process can furthermore not be sufficiently characterized by P 
and V. One requires for their characterization more degrees of freedom, or in 
other words, a space of higher dimensionality. 

There are also quasistatic processes which are irreversibe (e.g. temper- 
ature equalization via a poor heat conductor, 3.6.3.1; or a Gay-Lussac ex- 
periment carried out slowly, 3.6.3.6). Even in such processes, equilibrium 
thermodynamics is valid for the individual components of the system. 


Remark: 


We note that thermodynamics rests on equilibrium statistical mechanics. 
In reversible processes, the course of events is so slow that the system is 
in equilibrium at each moment; in irreversible processes, this is true of at 
least the initial and final states, and thermodynamics can be applied to these 
states. In the following sections, we will clarify the concepts just introduced 
on the basis of some typical examples. In particular, we will investigate how 
the entropy changes during the course of a process. 
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3.5.2 The Irreversible Expansion of a Gas; the Gay-Lussac 
Experiment (1807) 


The Gay-Lussac experiment* deals with the adiabatic expansion of a gas and 
is carried out as follows: a container of volume V which is insulated from its 
surroundings is divided by partition into two subvolumes, V; and V3. Initially, 
the volume V, contains a gas at a temperature T’, while V2 is evacuated. The 
partition is then removed and the gas flows rapidly into V2 (Fig. 3.4). 


Fig. 3.4. The Gay-Lussac experiment 


After the gas has reached equilibrium in the whole volume V = V; + Va, its 
thermodynamic quantities are determined. 

We first assume that this experiment is carried out using an ideal gas. The 
initial state is completely characterized by its volume V; and the temperature 
T. The entropy and the pressure before the expansion are, from (2.7.27) and 
(2.7.25), given by 


4) Vi/N NkT 
$= Nk (3 + oe | and P= Wo? 


with the thermal wavelength ): 


h 
V2amkT 


In the final state, the volume is now V = V, + Vg. The temperature is still 
equal to T, since the energy remains constant and the caloric equation of 
state of ideal gases, EF = 3kTN , contains no dependence on the volume. The 
entropy and the pressure after the expansion are: 


5. V/N NK 
'=Nk(241 pee, 
e (5 + les ) V 


We can see that in this process, there is an entropy production of 
4 
AS = S'—S = Nklog = > 0. (3.5.1) 
1 


* Louis Joseph Gay-Lussac, 1778-1850. The goal of Gay-Lussac’s experiments was 
to determine the volume dependence of the internal energy of gases. 
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It is intuitively clear that the process is irreversible. Since the entropy in- 
creases and no heat is transferred, (5Q = 0), the mathematical criterion for 
an irreversible process, Eq. (3.6.8) (which remains to be proved), is fulfilled. 
The initial and final states in the Gay-Lussac experiment are equilibrium 
states and can be treated with equilibrium thermodynamics. The interme- 
diate states are in general not equilibrium states, and equilibrium thermo- 
dynamics can therefore make no statements about them. Only when the 
expansion is carried out as a quasistatic process can equilibrium thermody- 
namics be applied at each moment. This would be the case if the expansion 
were carried out by allowing a piston to move slowly (either by moving a 
frictionless piston in a series of small steps without performing work, or by 
slowing the expansion of the gas by means of the friction of the piston and 
transferring the resulting frictional heat back into the gas). 

For an arbitrary isolated gas, the temperature change per unit volume at 
constant energy is given by 


Gr ey - “Gee? _ 2 (p (FF), ) > 528) 


where the Maxwell relation (23), a (3), has been employed. This co- 


efficient has the value 0 for an ideal gas, but for real gases it can have 
either a positive or a negative sign. The entropy production is, owing 
to dE = TdS — PdV = 0, given by 


Os P 
— == 3.5.2b 
(sr). Po, (3.5.2) 


i.e. dS > 0. Furthermore, no heat is exchanged with the surroundings, that 
is, 6Q = 0. Therefore, it follows that the inequality between the change in 
the entropy and the quantity of heat transferred 


TdS > 6Q (3.5.3) 


holds here. 

The coefficients calculated from equilibrium thermodynamics (3.5.2a,b) can 
be applied to the whole course of the Gay-Lussac experiment if the process 
is carried out in a quasistatic manner. Yet it remains an irreversible process! 
By integration of (3.5.2a,b), one obtains the differences in temperature and 
entropy between the final and initial states. The result can by the way also be 
applied to the non-quasistatic irreversible process, since the two final states 
are identical. We shall return to the quasistatic, irreversible Gay-Lussac ex- 
periment in 3.6.3.6. 
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3.5.3 The Statistical Foundation of Irreversibility 


How irreversible is the Gay-Lussac process? In order to understand why the 
Gay-Lussac experiment is irreversible, we consider the case that the volume 
increase dV fulfills the inequality 6bV < V, where V now means the initial 
volume (see Fig. 3.5). 


Fig. 3.5. Illustration of the Gay-Lussac ex- 
periment 


In the expansion from V to V + 6V, the phase-space surface changes from 
2(E,V) to Q(E,V + 6V), and therefore the entropy changes from S(F,V) 
to S(E,V + 6V). After the gas has carried out this expansion, we ask what 
the probability would be of finding the system in only the subvolume V. 
Employing (1.3.2), (2.2.4), and (2.3.4), we find this probability to be given 
by 
dqdp 0o(H-E) 2(E,V) 


~ J NIRBN Q(E,V +6V) Q(E,V +6V) sed) 
4 


W(E,V) 


= e- (S(E,V+5V)-S(E,V))/k _ 


(2s P bV. 
=e (37) g9V/* _ see er Ne 


where in the last rearrangement, we have assumed an ideal gas. Due to the 
factor N ~ 1078 in the exponent, the probability that the system will return 
spontaneously to the volume V is vanishingly small. 

In general, it is found that for the probability, a constraint (a restriction 
C) occurs spontaneously: 


W(E,C) =e“ CP)-SEey/F (3.5.5) 


We find that S(E,C) < S(E), since under the constraint, fewer states are 
accessible. The difference S(E£) — S(E£,C) is macroscopic; in the case of 
the change in volume, it was proportional to NéV/V, and the probability 
W(E,C) ~ e~% is thus practically zero. The transition from a state with a 
constraint C to one without this restriction is irreversible, since the probabil- 
ity that the system will spontaneously search out a state with this constraint 
is vanishingly small. 
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3.5.4 Reversible Processes 


In the first subsection, we consider the reversible isothermal and adiabatic 
expansion of ideal gases, which illustrate the concept of reversibility and are 
important in their own right as elements of thermodynamic processes. 


3.5.4.1 Typical Examples: the Reversible Expansion of a Gas 


In the reversible expansion of an ideal gas, work is performed on a spring by 
the expanding gas and energy is stored in the spring (Fig. 3.6). This energy 
can later be used to compress the gas again; the process is thus reversible. It 
can be seen as a reversible variation of the Gay-Lussac experiment. Such a 
process can be carried out isothermally or adiabatically. 


Daaaaa 


Fig. 3.6. The reversible isothermal expansion of a 
gas, where the work performed is stored by a spring. 
The work performed by the gas is equal to the area 
below the isotherm in the P — V diagram. 


a) Isothermal Expansion of a Gas, T = const. 

We first consider the isothermal expansion. Here, the gas container is in a 
heat bath at a temperature JT. On expansion from the initial volume Vj, to 
the final volume V, the gas performs the work:° 


Vv 4 
NkT 
w= [rw = fu NET log. (3.5.6) 
v VY 
Vi Vi 


This work can be visualized as the area below the isotherm in the P — V 
diagram (Fig. 3.6). Since the temperature remains constant, the energy of 
the ideal gas is also unchanged. Therefore, the heat bath must transfer a 
quantity of heat 


° We distinguish the work performed by the system (W) from work performed on 
the system (W), we use different symbols, implying opposite signs: WW = —W. 
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Q=W (3.5.7) 


to the system. The change in the entropy during this isothermal expansion 
is given according to (2.7.27) by: 


AS = Nese —. (3.5.8) 
Vi 


Comparison of (3.5.6) with (3.5.8) shows us that the entropy increase and 
the quantity of heat taken up by the system here obey the following relation: 
Q 

AS = i (3.5.9) 
This process is reversible, since using the energy stored in the spring, one 
could compress the gas back to its original volume. In this compression, the 
gas would release the quantity of heat Q to the heat bath. The final state of 
the system and its surroundings would then again be identical to their original 
state. In order for the process to occur in a quasistatic way, the strength of 
the spring must be varied during the expansion or compression in such a 
way that it exactly compensates the gas pressure P (see the discussion in 
Sect. 3.5.4.2). One could imagine the storage and release of the energy from 
the work of compression or expansion in an idealized thought experiment to 
be carried out by the horizontal displacement of small weights, which would 
cost no energy. 

We return again to the example of the irreversible expansion (Sect. 3.5.2). 
Clearly, by performing work in this case we could also compress the gas after 
its expansion back to its original volume, but then we would increase its 
energy in the process. The work required for this compression is finite and 
its magnitude is proportional to the change in volume; it cannot, in contrast 
to the case of reversible processes, in principle be made equal to zero. 


b) Adiabatic Expansion of a Gas, AQ = 0 

We now turn to the adiabatic reversible expansion. In contrast to Fig. 3.6, 
the gas container is now insulated from its surroundings, and the curves in 
the P-V diagram are steeper. In every step of the process, 6Q = 0, and 
since work is here also performed by the gas on its surroundings, it cools on 
expansion. It then follows from the First Law that 


dE = —Pdv . 


If we insert the caloric and the thermal equations of state into this equation, 
we find: 


cae Lae (3.5.10) 


Integration of the last equation leads to the two forms of the adiabatic equa- 
tion for an ideal gas: 
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T=T(Vi/v)" and P=Nk VZV , (3.5.11a,b) 


where the equation of state was again used to obtain b. 

We now once more determine the work W(V) performed on expansion 
from V, to V. It is clearly less than in the case of the isothermal expansion, 
since no heat is transferred from the surroundings. Correspondingly, the area 
beneath the adiabats is smaller than that beneath the isotherms (cf. Fig. 3.7). 
Inserting Eq. (3.5.11b) yields for the work: 


y isotherm 
adiabat 


Fig. 3.7. An isotherm and an adiabat pass- 
ing through the initial point (P:,Vi), with P; = 
Vy Vv NET /Vi 


Vv) = | wv p=Syen(i-(%)“"), 53 


geometrically, this is the area beneath the adiabats, Fig. 3.7. The change in 
the entropy is given by 


3 
AS = Nklog (4 4 =0, (3.5.13) 
1 


and it is equal to zero. We are dealing here with a reversible process in 
an isolated systems, (AQ = 0), and find AS = 0, ie. the entropy remains 
unchanged. This is not surprising, since for each infinitesimal step in the 
process, 


TdS = 6Q =0 (3.5.14) 


holds. 


*3.5.4.2 General Considerations of Real, Reversible Processes 


We wish to consider to what extent the situation of a reversible process can 
indeed be realized in practice. If the process can occur in both directions, 
what decides in which direction it in fact proceeds? To answer this question, 
in Fig. 3.8 we consider a process which takes place between the points 1 
and 2. 
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PRPLOCOARS > 
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Fig. 3.8. A reversible process. P is the internal pressure of the system (solid line). 
P, is the external pressure produced by the spring (dashed line). 


The solid curve can be an isotherm or a polytrope (i.e. an equilibrium curve 
which lies between isotherms and adiabats). Along the path from 1 to 2, the 
working substance expands, and from 2 to 1, is is compressed again, back to 
its initial state 1 without leaving any change in the surroundings. At each 
moment, the pressure within the working substance is precisely compensated 
by the external pressure (produced here by a spring). 

This quasistatic reversible process is, of course, an idealization. In order 
for the expansion to occur at all, the external pressure P™* must be some- 
what lower than P during the expansion phase of the process. The external 
pressure is indicated in Fig. 3.8 by the dashed curve. This curve, which is 
supposed to characterize the real course of the process, is drawn in Fig. 3.8 
as a dashed line, to indicate that a curve in the P — V diagram cannot fully 
characterize the system. In the expansion phase with P, < P, the gas near 
the piston is somewhat rarefied. This effectively reduces its pressure and the 
work performed by the gas is slightly less than would correspond to its actual 
pressure. Density gradients occur, i.e. there is a non-equilibrium state. The 
work obtained (which is stored as potential energy in the spring), [- : dV. PE, 
then obeys the inequality 


2 
2 2 
[wv pe < | dV P ay dV Poe, (3.5.15) 
1 1 
1 


For the compression, we must have P©°™ > P. On returning to point 1, the 
work — dV P, = fi dV P, (which is equal to the area enclosed by the dashed 
curve) is performed. This work is given up to the heat bath in the form of a 
heat loss AQ_. [Frictional losses; turbulent motions when the process is too 
rapid, which also produce heat.] 


2 1 
=f rav>( {rave fpav) =o. (3.5.16) 
1 2 


The inequality results from the fact that P, 2 P, that is, the gas and the 
spring are not in equilibrium. On returning to point 1, the entropy is again 
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equal to the initial entropy, that is the change in the entropy AS = 0. There- 
fore, the preceding inequality can also be written in the form 


AO =AOe = TAS (3.5.17) 


where AQ is the quantity of heat taken up by the system (which is negative). 
These irreversible losses can in principle be made arbitrarily small by moving 
the piston very slowly. The reversible process is the ideal limiting case of 
extreme slowness. 

Analogously, for processes with heat transfer, small temperature differ- 
ences must be present. In order for the heat bath to give up heat to the 
system, it must be slightly warmer; in order for it to take on heat from the sys- 
tem, it must be slightly cooler. After a whole cycle has been passed through, 
heat will have been transferred from the warmer a to the cooler b (Fig. 3.9). 


> 
S Fig. 3.9. Heat transfer 


Strictly reversible processes are in fact not processes which proceed con- 
tinuously in time, but rather a sequence of equilibrium states. All processes 
which occur in practice as continuous variations with time are irreversible; 
they contain equilibration processes between perturbed equilibrium states. 
In spite of their unrealistic character in a strict sense, reversible processes 
play a major role in thermodynamics. While in thermodynamics, statements 
about irreversible processes can be made only in the form of inequalities 
which determine the direction of the process, for reversible processes one can 
make precise predictions, which can also be achieved in practice as limiting 
cases. To be sure, thermodynamics can also deliver precise predictions for 
irreversible processes, namely for the relation between their initial and final 
states, as we have seen for the case of the irreversible adiabatic expansion. 


3.5.5 The Adiabatic Equation 


Here, we want to first discuss generally the adiabatic equation and then to 
apply it to ideal gases. We start from Eq. (3.2.23), 


OP\ _ Cp (dP 
(ai) Ge ae) sea 
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and define the ratio of the specific heats: 


=o: (3.5.19) 


K 
According to (3.3.6), « > 1, and therefore for every substance, the slope of 
the adiabats, P = P(V,S = const.), is steeper than that of the isotherms, 
P= P(V,T = const.). 

For a classical ideal gas, we find «k=const.® and ($5)- = Ae = Lc ; 
It thus follows from (3.5.18) 


(FF) = age (3.5.20) 


The solution of this differential equation is 


PV* = const , 
and with the aid of the equation of state, we then find 


TV*—' = const . (3.5.21) 


For a monatomic ideal gas, we have k = = 3, where we have made use 


of (3.2.27). 


3.6 The First and Second Laws of Thermodynamics 


3.6.1 The First and the Second Law for Reversible and 
Irreversible Processes 


3.6.1.1 Quasistatic and in Particular Reversible Processes 


We recall the formulation of the First and Second Laws of Thermodynamics 
in Eqns. (3.1.3) and (3.1.5). In the case of reversible transitions between an 
equilibrium state and a neighboring, infinitesimally close equilibrium state, 
we have 


dE = 6Q — PdV + pdN (3.6.1) 
with 
6Q=TdS. (3.6.2) 


® This is evident for a monatomic classical ideal gas from (3.2.27). For a molecular 
ideal gas as treated in Chap. 5, the specific heats are temperature independent 
only in those temperature regions where particular internal degrees of freedom 
are completely excited or not excited at all. 
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Equations (3.6.1) and (3.6.2) are the mathematical formulations of the First 
and Second Laws. The Second Law in the form of Eq. (3.6.2) holds for re- 
versible (and thus necessarily quasistatic) processes. It is also valid for qua- 
sistatic irreversible processes within those subsystems which are in equilib- 
rium at every instant in time and in which only quasistatic transitions from an 
equilibrium state to a neighboring equilibrium state take place. (An example 
of this is the thermal equilibration of two bodies via a poor heat conductor 
(see Sect. 3.6.3.1). The overall system is not in equilibrium, and the process 
is irreversible. However, the equilibration takes place so slowly that the two 
bodies within themselves are in equilibrium states at every moment in time). 


3.6.1.2 Irreversible Processes 
For arbitrary processes, the First Law holds in the form given in Eq. (3.1.3’): 
dE =6Q+6W +6En , (3.6.1’) 


where 6Q, dW, and 6Ey are the quantity of heat transferred, the work 
performed on the system, and the increase in energy through addition of 
matter. 

In order to formulate the Second Law with complete generality, we re- 
call the relation (2.3.4) for the entropy of the microcanonical ensemble and 
consider the following situation: we start with two systems 1 and 2 which 
are initially separated and are thus not in equilibrium with each other; their 
entropies are S; and Sz. We now bring these two systems into contact. The 
entropy of this nonequilibrium state is 


Sinitial = S1 + So. (3.6.3) 


Suppose the two systems to be insulated from their environment and their 
total energy, volume, and particle number to be given by FE, V and N. Now the 
overall system passes into the microcanonical equilibrium state corresponding 
to these macroscopic values. Owing to the additivity of entropy, the total 
entropy after equilibrium has been reached is given by 


Si42(E,V,N) = $1(F,Vi, M1) + So(E2, Vo, No) , (3.6.4) 


where E,, VY, (Ea, V2, No) are the most probable values of these quan- 
tities in the subsystem 1 (2). Since the equilibrium entropy is a maximum 
(Eq. 2.3.5), the following inequality holds: 
Si + So = Sinitial (3.6.5) 
<S140(E, V, N) = S1(F1,Vi, N1) a So(E2, V2, No) ‘ 


Whenever the initial density matrix of the combined systems 1+2 is not 
already equal to the microcanonical density matrix, the inequality sign holds. 
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We now apply the inequality (3.6.5) to various physical situations. 


(A) Let an isolated system be in a non-equilibrium state. We can decom- 
pose it into subsystems which are in equilibrium within themselves and apply 
the inequality (3.6.5). Then we find for the change AS in the total entropy 


AS>0. (3.6.6) 


This inequality expresses the fact that the entropy of an isolated systems can 
only increase and is also termed the Clausius principle. 

(B) We consider two systems 1 and 2 which are in equilibrium within 
themselves but are not in equilibrium with each other. Let their entropy 
changes be denoted by AS) and AS. From the inequality (3.6.5), it follows 
that 


We now assume that system 2 is a heat bath, which is large compared to 
system 1 and which remains at the temperature T throughout the process. 
The quantity of heat transferred to system 1 is denoted by AQ. For system 2, 
the process occurs quasistatically, so that its entropy change AS» is related 
to the heat transferred, -AQ,, by 


1 
AS» ane —7pAQi . 
Inserting this into Eq. (3.6.7), we find 
1 
AS) > pag : (3.6.8) 


In all the preceding relations, the quantities AS and AQ are by no means 
required to be small, but instead represent simply the change in the entropy 
and the quantity of heat transferred. 

In the preceding discussion, we have considered the initial state and as fi- 
nal state a state of overall equilibrium. In fact, these inequalities hold also for 
portions of the relaxation process. Each intermediate step can be represented 
in terms of equilibrium states with constraints, whereby the limitations im- 
posed by the constraints decrease in the course of time. At the same time, 
the entropy increases. Thus, for each infinitesimal step in time, the change in 
entropy of the isolated overall system is given by 


dS >0. (3.6.6’) 


For the physical situation described under B, we have 


aS) > Z6Q1. (3.6.8) 
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We now summarize the content of the First and Second Laws. 


The First Law: 
dE = 6Q+6W +6En (3.6.9) 


Change of energy = heat transferred + work performed + energy change due 
to transfer of matter; FE is a state function. 


The Second Law: 
6Q<TdS (3.6.10) 


and S is a state function. 
a) For reversible changes: dQ = TdS. 
b) For irreversible changes: dQ < TdS. 


Notes: 


(i) The equals sign in Eq. (3.6.10) holds also for irreversible quasistatic pro- 
cesses in those subregions which are in equilibrium in each step of the process 
see Sect. 3.6.3.1). 

ii) In (3.6.10), we have combined (3.6.6’) and (3.6.8’). The situation of the iso- 
lated system (3.6.6) is included in (3.6.10), since in this case 6@ = 0 (see the 
example 3.6.3.1). 

iii) In many processes, the particle number remains constant (dN = 0). Therefore, 
we often employ (3.6.9) considering only 6Q and 6W, without mentioning this 
expressly each time. 


We now wish to apply the Second Law to a process which leads from a 
state A to a state B as indicated in Fig. 3.10. If we integrate (3.6.10), we 
obtain 


and from this, 
7 6 
Sp- Saf : (3.6.11) 
A 


For reversible processes, the equals sign holds; for irreversible ones, the in- 
equality. In a reversible process, the state of the system can be completely 
characterized at each moment in time by a point in the P—V-diagram. In 
an irreversible process leading from one equilibrium state (possibly with con- 
straints) A to another equilibrium state B, this is not in general the case. 
This is indicated by the dashed line in Fig. 3.10. 


3.6 The First and Second Laws of Thermodynamics 107 


V 
Fig. 3.10. The path of a process con- Fig. 3.11. A cyclic process, repre- 
necting two thermodynamic states A sented by a closed curve in the P— V- 
and B diagram, which leads back to the start- 


ing point (B = A), whereby at least 
to some extent irreversible changes of 
state occur. 


We consider the following special cases: 


(i) An adiabatic process: For an adiabatic process (5Q = 0), it follows from 
(3.6.11) that 


Sp>Sa or AS>O0. (3.6.11’) 


The entropy of a thermally isolated system cannot decrease. This state- 
ment is more general than Eq. (3.6.6), where completely isolated systems 
were assumed. 

(ii) Cyclic processes: For a cyclic process, the final state is identical with the 
initial state, B = A (Fig. 3.11). Then we have S'p = S'4 and and it follows 
from Eq. (3.6.11) for a cyclic process that the inequality 


0> ¢F o@ (3.6.12) 


holds, where the line integral ¢ is calculated along the closed curve of 
Fig. 3.11, corresponding to the actual direction of the process. 


*3.6.2 Historical Formulations of the Laws of Thermodynamics 
and other Remarks 


The First Law 


There exists no perpetual motion machine of the first kind (A perpetual mo- 
tion machine of the first kind refers to a machine which operates periodically 
and functions only as a source of energy). Energy is conserved and heat is only 
a particular form of energy, or more precisely, energy transfer. The recogni- 
tion of the fact that heat is only a form of energy and not a unique material 
which can penetrate all material bodies was the accomplishment of Julius 
Robert Mayer (a physician, 1814-1878) in 1842. 
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James Prescott Joule (a brewer of beer) carried out experiments in the years 
1843-1849 which demonstrated the equivalence of heat energy and the energy 
of work 


1 cal = 4.1840 x 10” erg = 4.1840 Joule . 
The First Law was mathematically formulated by Clausius: 
6Q=dE+PdV. 


The historical formulation quoted above follows from the First Law, which 
contains the conservation of energy and the statement that E is a state 
variable. Thus, if a machine has returned to its initial state, its energy must 
be the same as before and it can therefore not have given up any energy to 
its environment. 


Second Law 


Rudolf Clausius (1822-1888) in 1850 : Heat can never pass on its own from 
a cooler reservoir to a warmer one. 

William Thomson (Lord Kelvin, 1824-1907) in 1851: The impossibility of a 
perpetual motion machine of the second kind. (A perpetual motion machine 
of the second kind refers to a periodically operating machine, which only 
extracts heat from a single reservoir and performs work.) 

These formulations are equivalent to one another and to the mathematical 
formulation. 


Equivalent formulations of the Second Law. 


The existence of a perpetual motion machine of the second kind could be used to 
remove heat from a reservoir at the temperature 7). The resulting work could then 
be used to heat a second reservoir at the higher temperature T2. The correctness 
of Clausius’ statement thus implies the correctness of Kelvin’s statement. 

If heat could flow from a colder bath to a warmer one, then one could use 
this heat in a Carnot cycle (see Sect. 3.7.2) to perform work, whereby part of the 
heat would once again be taken up by the cooler bath. In this overall process, only 
heat would be extracted from the cooler bath and work would be performed. One 
would thus have a perpetual motion machine of the second kind. The correctness 
of Kelvin’s statement thus implies the correctness of Clausius’ statement. 

The two verbal formulations of the Second Law, that of Clausius and that of 
Kelvin, are thus equivalent. It remains to be demonstrated that Clausius’ statement 
is equivalent to the differential form of the Second Law (Eq. 3.6.10). To this end, 
we note that it will be shown in Sect. 3.6.3.1 from (3.6.10) that heat passes from a 
warmer reservoir to a cooler one. Clausius’ statement follows from (3.6.10). Now we 
must only demonstrate that the relation (3.6.10) follows from Clausius’ statement. 
This can be seen as follows: if instead of (3.6.10), conversely TdS < 6Q would 
hold, then it would follow form the consideration of the quasistatic temperature 
equilibration that heat would be transported from a cooler to a warmer bath; 
ie. that Clausius’ statement is false. The correctness of Clausius’ statement thus 
implies the correctness of the mathematical formulation of the Second Law (3.6.10). 
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All the formulations of the Second Law are equivalent. We have included these 
historical considerations here because precisely their verbal formulations show the 
connection to everyday consequences of the Second Law and because this type of 
reasoning is typical of thermodynamics. 


The Zeroth Law 


When two systems are in thermal equilibrium with a third system, then they 
are in equilibrium with one another. 

Proof within statistical mechanics: 

Systems 1, 2, and 3. Equilibrium of 1 with 3 implies that 7, = T3 and that 
of 2 with 3 that T> = 73; it follows from this that T; = Tb, i.e. 1 and 2 are 
also in equilibrium with one another. The considerations for the pressure and 
the chemical potential are exactly analogous. 

This fact is of course very important in practice, since it makes it possible to 
determine with the aid of thermometers and manometers whether two bodies 
are at the same temperature and pressure and will remain in equilibrium or 
not if they are brought into contact. 


The Third Law 


The Third Law (also called Nernst’s theorem) makes statements about the 
temperature dependence of thermodynamic quantities in the limit T’ — 0; it 
is discussed in the Appendix A.1. Its consequences are not as far-reaching as 
those of the First and Second Laws. The vanishing of specific heats as T — 0 
is a direct result of quantum mechanics. In this sense, its postulation in the 
era of classical physics can be regarded as visionary. 


3.6.3 Examples and Supplements to the Second Law 


We now give a series of examples which clarify the preceding concepts and 
general results, and which have also practical significance. 


3.6.3.1 Quasistatic Temperature Equilibration 


We consider two bodies at the temperatures T, and T> and with entropies S; 
and Sg. These two bodies are connected by a poor thermal conductor and 
are insulated from their environment (Fig. 3.12). The two temperatures are 
different: T, 4 T>; thus, the two bodies are not in equilibrium with each other. 
Since the thermal conductor has a poor conductivity, all energy transfers 
occur slowly and each subsystem is in thermal equilibrium at each moment 
in time. Therefore, for a heat input dQ to body 1 and thus the equal but 
opposite heat transfer —dQ from body 2, the Second Law applies to both 
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subsystems in the form 


d=, dS=-—. (3.6.13) 


T T Fig. 3.12. Quasistatic temperature equi- 
1 2 libration of two bodies connected by a 


poor conductor of heat 


For the overall system, we have 
dS; +dSo>0, (3.6.14) 


since the total entropy increases during the transition to the equilibrium 
state. If we insert (3.6.13) into (3.6.14), we obtain 


5Q (= = z) >0. (3.6.15) 


We take T2 > T;; then it follows from (3.6.13) that dQ > 0, i.e. heat is trans- 
ferred from the warmer to the cooler container. We consider here the differ- 
ential substeps, since the temperatures change in the course of the process. 
The transfer of heat continues until the two temperatures have equalized; the 
total amount of heat transferred from 2 to 1, [ dQ, is positive. 

Also in the case of a non-quasistatic temperature equilibration, heat is 
transferred from the warmer to the cooler body: if the two bodies mentioned 
above are brought into contact (again, of course, isolated from their envi- 
ronment, but without the barrier of a poor heat conductor), the final state 
is the same as in the case of the quasistatic process. Thus also in the non- 
quasistatic temperature equilibration, heat has passed from the warmer to 
the cooler body. 


3.6.3.2 The Joule~—Thomson Process 


The Joule-Thomson process consists of the controlled expansion of a gas 
(cf. Fig. 3.13). Here, the stream of expanding gas is limited by a throttle 
valve. The gas volume is bounded to the left and the right of the throttle by 
the two sliding pistons S; and S2, which produce the pressures P,; and P2 in 
the left and right chambers, with P; > P). The process is assumed to occur 
adiabatically, i.e. 6Q = 0 during the entire process. 

In the initial state (1), the gas in the left-hand chamber has the volume 
V, and the energy F}. In the final state, the gas is entirely in the right-hand 
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Fig. 3.13. A Joule-Thomson pro- 
cess, showing the sliding pistons 51 
and Seg and the throttle valve T 


chamber and has a volume V2 and energy E2. The left piston performs work 
on the gas, while the gas performs work on the right piston and thus on the 
environment. The difference of the internal energies is equal to the total work 
performed on the system: 


2 2 0 V2 
En — Ey poe pow fucro+ fav 
1 1 Vi 0 


= PV, — Pov. . 


From this it follows that the enthalpy remains constant in the course of this 
process: 


Hp =H, , (3.6.16) 


where the definition H; = E; + P;V; was used. 

For cryogenic engineering it is important to know whether the gas is 
cooled by the controlled expansion. This is determined by the Joule-Thomson 
coefficient: 


(35) = (SP) 7 T(3p)e+V _T(ar)p-V ; 
ua (3r)p T (30) p OP 
In the rearrangement, we have used (3.2.21), dH = TdS + VdP, and the 


Maxwell relation (3.2.10). Inserting the thermal expansion coefficient a, we 
find the following expression for the Joule-Thomson coefficient: 


OT V 
— = —(Ta-1). 3.6.17 
(55) - ate (3.6.17) 
For an ideal gas, a = ae in this case, there is no change in the temperature 


on expansion. For a real gas, either cooling or warming can occur. When 
a> aa the expansion leads to a cooling of the gas (positive Joule-Thomson 
effect). When a < on then the expansion gives rise to a warming (negative 
Joule-Thomson effect). The limit between these two effects is defined by the 
inversion curve, which is given by 


(3.6.18) 


ey 
=F: 
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We shall now calculate the inversion curve for a van der Waals gas, beginning 
with the van der Waals equation of state (Chap. 5) 


kT 
P= ss eee (3.6.19) 


v—b v 
We differentiate the equation of state with respect to temperature at constant 
pressure 


j= k kT Ov ined Ov 
—u-b (v—b)? \OT/, wv \OT/] >,’ 


In this expression, we insert the condition (3.6.18) 


_ 1 /fodv i 
eines i) eae 


for ($4) , and thereby obtain 0 = £ — *, + 2¢4(v—b). Using the van-der- 


v vs T 
Waals equation again, we finally find for the inversion curve 


v ye vs 
that is 
2a 3a 


In the limit of low density, we can neglect the second term in (3.6.20) and 
the inversion curve is then given by 
2a kk Tiny 2a 
= = Ting = = = 6.15 T,. 3.6.21 
bu v , me bk ( ) 
Here, T, is the critical temperature which follows from the van der Waals 
equation (5.4.13). For temperatures which are higher than the inversion tem- 
perature Tjny, the Joule-Thomson effect is always negative. The inversion 
temperature and other data for some gases are listed in Table [.4 in the 
Appendix. 
The change in entropy in the Joule-Thomson process is determined by 


Os 4 


as can be seen using dH = TdS + VdP = 0. Since the pressure decreases, we 
obtain for the entropy change dS > 0, although 6Q = 0. The Joule-Thomson 
process is irreversible, since its initial state with differing pressures in the 
two chambers is clearly not an equilibrium state. 

The complete inversion curve from the van der Waals theory is shown in 
Fig. 3.14a,b. Within the inversion curve, the expansion leads to cooling of 
the gas. 
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Pe wade 

t 3b Sb 10b 15b = 20b V 

? (a) (b) 
(a) The inversion curve for the Joule— (b) The inversion curve in the P-T 
Thomson effect (upper solid curve). diagram. 


The isotherm is for T = 6.75 T. (dot- 
dashed curve). The shaded region is 
excluded, since in this region, the va- 
por and liquid phases are always both 
present. 


Fig. 3.14. The inversion curve for the Joule-Thomson effect 


3.6.3.3 Temperature Equilibration of Ideal Gases 


We will now investigate the thermal equilibration of two monatomic ideal 
gases (a and b). Suppose the two gases to be separated by a sliding piston 
and insulated from their environment (Fig. 3.15). 


Fig. 3.15. The thermal equilibration of two 
ideal gases 


The pressure of the two gases is taken to be equal, P, = P, = P, while 
their temperatures are different in the initial state, T, 4 Ty. Their volumes 
and particle numbers are given by V,, Vj and Na, No, so that the total volume 
and total particle number are V = V, + Vy and N = N, + Np. The entropy 
of the initial state is given by 


5 | Va | 5 | Vo 
S=S45,=k{ N, € | los | + Np (5 | lor xa) } . (3.6.23) 
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The temperature after the establishment of equilibrium, when the tempera- 
tures of the two systems must approach the same value according to Chap. 2, 
will be denoted by T. 

Owing to the conservation of energy, we have 3N kT = 3 NakTat+ 3.NpkTp, 
from which it follows that 


NaTa + NoTo 
T= = cay Le 3.6.24 
N, +N; c + colo ( ) 
where we have introduced the ratio of the particle numbers, Cap = eae We 
recall the definition of the thermal wavelengths 
h / h 
ON a ——————— eb =. 
\/27Ma,bkT a,b i \/27Me,pkT 
The entropy after the establishment of equilibrium is 
5 V; 5 Vv 
t= | 
S = kN {3 log “es kN {54 t 108 a is} ‘ 
so that for the entropy increase, we find 
; es : VAP 
S'—S=kN, log 4 V, Vi + kN» log V, NB (3.6.25) 


We shall also show that the pressure remains unchanged. To this end, we 
add the two equations of state of the subsystems before the establishment of 
thermal equilibrium 


VaP =NgkTa,  VoP = NykTh (3.6.26) 
and obtain using (3.6.24) the expression 
(Va + Vb) P = (Na + No)kT . (3.6.26b) 


From the equations of state of the two subsystems after the establishment of 
equilibrium 


Vopr = NapkT (3.6.26a’) 
with Vi + Vj = V, it follows that 
VP! =(Na+N,)kT , (3.6.26b’) 


i.e. P’ = P. Incidentally, in (3.6.24) and (3.6.26b’), the fact is used that the 
two monatomic gases have the same specific heat. Comparing (3.6.26b) and 
(3.6.26b’), we find the volume ratios 
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From this we obtain 


5 TNatNo 
! = 
Ss 8 = 98108 CNapm ; 
which finally yields 
5 T 5 Cala + cyly 


S-S= aN log = 5 KN log (3.6.27) 


Cao Cab 
Ty T, ai T, 


Due to the convexity of the exponential function, we have 


TT; = exp(ca log Ta + cy log Th) < ca exp log T, + cy exp log T, 
= Cala + col =f ; 


and thus it follows from (3.6.27) that S’ — S > 0, i.e. the entropy increases 
on thermal equilibration. 


Note: 


Following the equalization of temperatures, in which heat flows from the warmer 
to the cooler parts of the system, the volumes are given by: 


Na No 


a PING = ieee 


Ve Me: Ws 
Together with Eq. (3.6.26b), this gives Vi/V. = T/T. and Vj/V, = T/T. The 
energy which is put into subsystem a is AF, = 3Nak(T — Ta). 

The enthalpy increase in subsystem a is given by AH, = 3 Nak(T — T,). Since the 
process is isobaric, we have AQ, = AH,. The work performed on subsystem a is 
therefore equal to 


AW, = AFa — AQa = —Nak(T — Ta) - 
The warmer subsystem gives up heat. Since it would then be too rarefied for the 


pressure P, it will be compressed, i.e. it takes on energy through the work performed 
in this compression. 


3.6.3.4 Entropy of Mixing 

We now consider the process of mixing of two different ideal gases with the 

masses M, and mp. 

The temperatures and pressures of the gases are taken to be the same, 
T,2=T.=T , Pi, =Rh=P. 


From the equations of state, 


VP = NakT , VoP = NokT 
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Fig. 3.16. The mixing of two gases 


it follows that 


Na No _ Na Al No 


Vi Vo Va + Vo 


Sar? the entropy when the gases 


Using the thermal wavelength Aq,» = eee 


are separated by a partition is given by 


5 Va 5 Vs 
= Go +5, =i Na(>+1 BM 244 . (3.6.28 
ane { € 8x) aC ux) f oon 


After removal of the partition and mixing of the gases, the value of the entropy 


is 

5 Va + Vo 5 Va + Vo 

S'= k{N, ( + log ) + Np ( + log )} ‘ (3.6.29) 
2 N, x3 2 N, 2 


From Egns. (3.6.28) and (3.6.29), we obtain the difference in the entropies: 


(Na + Np) NetNo 
NaN 


1 
S’—S=klog = k(Na + Np) log (=<) >0, 
a ~b 


where we have used the relative particle numbers 


Na,b 


Ca,b = N+ Ns . 


Since the argument of the logarithm is greater than 1, we find that the entropy 
of mixing is positive, 


eg. Na=N,, S’— S$ =2kN, log2. 
The entropy of mixing always occurs when different gases interdiffuse, even 


when they consist of different isotopes of the same element. When, in con- 
trast, the gases a and 0 are identical, the value of the entropy on removing 
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the partition is 


Sty = k(Na + Ns) {5 


Vat Vi 
5 t log ame. \ 


6.29) 
(Na + No)A3 ee 


and \ = Ag = Ay. We then have 


Na nrN 
‘ag —-5 = klog aE ae =0 
: (Nak Ne) Nethe vines 


making use of the equation of state; therefore, no entropy of mixing occurs. 
This is due to the factor 1/N! in the basic phase-space volume element in 
Eqns. (2.2.2) and (2.2.3), which results from the indistinguishability of the 
particles. Without this factor, Gibbs’ paradox would occur, i.e. we would 
find a positive entropy of mixing for identical gases, as mentioned following 
Eq. (2.2.3). 


*3.6.3.5 Heating a Room 


Finally, we consider an example, based on one given by Sommerfeld.’ A room is 
to be heated from 0°C to 20°C. What quantity of heat is required? How does the 
energy content of the room change in the process? 

If air can leave the room through leaks around the windows, for example, then 
the process is isobaric, but the number of air molecules in the room will decrease 
in the course of the heating process. The quantity of heat required depends on the 
increase in temperature through the relation 


6Q = CpdT , (3.6.30) 


where C'p is the heat capacity at constant pressure. In the temperature range that 
we are considering, the rotational degrees of freedom of oxygen, Oz, and nitrogen, 
No, are excited (see Chap. 5), so that under the assumption that air is an ideal gas, 
we have 


One INE (3.6.31) 
where JN is the overall number of particles. 


The total amount of heat required is found by integrating (3.6.31) between the 
initial and final temperatures, 71 and T»: 


T2 
Q= far Cp. (3.6.32) 
Ty 


If we initially neglect the temperature dependence of the particle number, and thus 
the heat capacity (3.6.31), we find 


CL 0a Ti) = EMiK(T2 Ti) . (3.6.32') 


” A. Sommerfeld, Thermodynamics and Statistical Mechanics: Lectures on Theo- 
retical Physics, Vol. V, (Academic Press, New York, 1956) 


118 3. Thermodynamics 


Here, we have denoted the particle number at 7 as Ni and taken it to be constant. 
Equation (3.6.32’) will be a good approximation, as long as T> © T}. 

If we wish to take into account the variation of the particle number within the 
room (volume V), we have to replace N in Eq. (3.6.31) by N from the equation of 
state, N = PV/kT, and it follows that 


CPF Ti % a 
= | a = +PV} SN RT log == 4 6. 
Q ‘: pr = PV loge = Nhl log (3.6.33) 


With log 7 = 7 14 oR 1)’), we obtain from (3.6.33) for small temperature 


differences the approximate formula (3.6.32’) 


oa T2- Th 6 dyn 6 3 20 _ 3.5 x 2 11 3 
Q gPV Tr 3.5 (10 al 10°(Vm Jang = 979 10°-erg (Vm”") 


= 6 kcal (Vm*). 
It is instructive to compute the change in the energy content of the room on heating, 
taking into account the fact that the rotational degrees of freedom are fully excited, 


T > O, (see Chap. 5). Then the internal energy before and after the heating 
procedure is 


Ei, = 


N; kT; NikO.e + Ni€el 


NO] ou wl] or 


fy —- Ey = 


1 1 1 “yt.” a 
k(NoT» — NiTi) — gPV®, (z zx) Pepys (x zx) 


(3.6.34) 


The first term is exactly zero, and the second one is positive; the third, dominant 
term is negative. The internal energy of the room actually decreases upon heating. 
The heat input is given up to the outside world, in order to increase the temperature 
in the room and thus the average kinetic energy of the remaining gas molecules. 

Heating with a fixed particle number (a hermetically sealed room) requires 
a quantity of heat Q = Cv(T2—-7T1) = 2 Ni k(T2 — T,). For small temperature 
differences T2 — Ti, it is then more favorable first to heat the room to the final 
temperature 77> and then to allow the pressure to decrease. The point of intersection 
of the two curves (P, N) constant and P constant, with N variable (Fig. 3.17) at 
TS is determined by 


T3—Ti _ Cp 

= “ee 

T; log re Vv 
A numerical estimate yields T? = 1.97, for the point of intersection in Fig. 3.17, 
ie. at T, = 273K, T? = 519K. 


For any process of space heating, isolated heating is more favorable. The differ- 
ence in the quantities of heat required is 


AQ & (Cp — Cv)(T2 — Th) = 358 kcal (Vm*) = 1.7 kcal (Vm*) . 
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P const. 
N variable 


Fig. 3.17. The quantity of heat required 
for space heating: as an isobaric process 
(solid curve), isochore (dashed curve), or 

T isobaric neglecting the decrease in particle 
number (dot-dashed curve). 


All of the above considerations have neglected the heat capacity of the walls. 
They are applicable to a rapid heating of the air. 
The change in pressure on heating a fixed amount of air by 20°C is, however, 


6p _ or 20 


Rose AS i ~ ~ 2m 2 
P T ~ 373 0.07 , ie. bP ~ 0.07 bar ~ 0.07 kg/cm* ~ 700 kg/m* ! 


*3.6.3.6 The Irreversible, Quasistatic Gay-Lussac Experiment 


We recall the different versions of the Gay-Lussac experiment. In the irreversible 
form, we have AQ = 0 and AS > 0 (3.5.1). In the reversible case (isothermal 
or adiabatic), using (3.5.9) and (3.5.14), the corresponding relation for reversible 
processes is fulfilled. 

It is instructive to carry out the Gay-Lussac experiment in a quasistatic, irre- 
versible fashion. One can imagine that the expansion does not take place suddenly, 
but instead is slowed by friction of the piston to the point that the gas always 
remains in equilibrium. The frictional heat can then either be returned to the gas 
or given up to the environment. We begin by treating the first possibility. Since 
the frictional heat from the piston is returned to the gas, there is no change in 
the environment after each step in the process. The final result corresponds to the 
situation of the usual Gay-Lussac experiment. For the moment, we denote the gas 
by an index 1 and the piston, which initially takes up the frictional heat, by 2. Then 
the work which the gas performs on expansion by the volume change dV is given 
by 


dbWi2 = Pav . 
This quantity of energy is passed by the piston to 1: 
6Q21 = OWi2 . 


The energy change of the gas is dE = 6Q2—1 — 6bWi—2 = 0. Since the gas is always 
in equilibrium at each instant, the relation dE = TdS — PdvV also holds and thus 
we have for the entropy increase of the gas: 


TdS = 6Q21 > 0. 


The overall system of gas + piston transfers no heat to the environment and also 
performs no work on the environment, i.e. 6Q = 0 and dW = 0. Since the entropy 
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of the piston remains the same (for simplicity, we consider an ideal gas, whose 
temperature does not change), it follows that TdS > 6Q. 

Now we consider the situation that the frictional heat is passed to the outside 
world. This means that 6Q21 = 0 and thus TdS = 0, also dS = 0. The total 
amount of heat given off to the environment (heat loss 6Q1) is 


6Qr =dWi-2 > 0. 


Here, again, the inequality —dQz, < TdS is fulfilled, characteristic of the irreversible 
process. The final state of the gas corresponds to that found for the reversible 
adiabatic process. There, we found AS = 0, Q = 0, and W > 0. Now, AS = 0, 
while Qx > 0 and is equal to the W of the adiabatic, reversible process, from 
Eq. (3.5.12). 


3.6.4 Extremal Properties 


In this section, we derive the extremal properties of the thermodynamic po- 
tentials. From these, we shall obtain the equilibrium conditions for multi- 
component systems in various phases and then again the inequalities (3.3.5) 
and (3.3.6). 

We assume in this section that no particle exchange with the environment 
occurs, i.e. dN; = 0, apart from chemical reactions within the system. Con- 
sider the system in general not yet to be in equilibrium; then for example in 
an isolated system, the state is not characterized solely by E, V, and N; , but 
instead we need additional quantities x,, which give e.g. the concentrations of 
the independent components in the different phases or the concentrations of 
the components between which chemical reactions occur. Another situation 
not in equilibrium is that of spatial inhomogeneities.® 

We now however assume that equilibrium with respect to the temperature 
and pressure is present, i.e. that the system is characterized by uniform (but 
variable) T and P values. This assumption may be relaxed somewhat. For 
the following derivation, it suffices that the system likewise be at the pressure 
P at the stage when work is being performed by the pressure P, and when 
it is exchanging heat with a reservoir at the temperature T, that it be at the 
temperature T. (This permits e.g. inhomogeneous temperature distributions 
during a chemical reaction in a subsystem.) Under these conditions, the First 
Law, Eq (3.6.9), is given by dE = 6Q — PdV. 


8 As an example, one could imagine a piece of ice and a solution of salt in water at 
P =1 atm and —5°C. Each component of this system is in equilibrium within 
itself. If one brings them into contact, then a certain amount of the ice will melt 
and some of the NaCl will diffuse into the ice until the concentrations are such 
that the ice and the solution are in equilibrium (see the section on eutectics). The 
initial state described here — a non-equilibrium state — is a typical example of 
an inhibited equilibrium. As long as barriers impede (inhibit) particle exchange, 
i.e. so long as only energy and volume changes are possible, this inhomogeneous 
state can be described in terms of equilibrium thermodynamics. 
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Our starting point is the Second Law, (3.6.10): 


dQ 
dS>. (3.6.35) 


We insert the First Law into this equation and obtain 
1 
dS > dE + PdV). (3.6.36a) 


We have used the principle of energy conservation from equilibrium ther- 
modynamics here, which however also holds in non-equilibrium states. The 
change in the energy is equal to the heat transferred plus the work performed. 
The precondition is that during the process a particular, well-defined pressure 
is present. 

If FE, V are held constant, then according to Eq. (3.6.36a), we have 


dS >0 for EF, V fixed ; (3.6.36b) 


that is, an isolated system tends towards a maximum of the entropy. When a 
non-equilibrium state is characterized by a parameter 2, its entropy has the 
form indicated in Fig. 3.18. It is maximal for the equilibrium value xp. The 
parameter x could be e.g. the volume of the energy of a subsystem of the 
isolated system considered. 

One refers to a process or variation as virtual— that is, possible in principle 
— if it is permitted by the conditions of a system. An inhomogeneous distri- 
bution of the energies of the subsystems with constant total energy would, 
to be sure, not occur spontaneously, but it is possible. In equilibrium, the 
entropy is maximal with respect to all virtual processes. 

We now consider the free enthalpy or Gibbs’ free energy, 


G=E-TS+PV, (3.6.37) 


which we define with Eq. (3.6.37) for non-equilibrium states just as for equi- 
librium states. For the changes in such states, we find from (3.6.36a) that the 
inequality 


dG < —SdT + VdP (3.6.38) 


holds. For the case that T and P are held constant, it follows from (3.6.38a) 
that 


dG<0 for T and P fixed, (3.6.38) 


i.e. the Gibbs’ free energy G tends towards a minimum. In the neighborhood 
of the minimum (Fig. 3.19), we have for a virtual (in thought only) variation 


C26 te ecan= 50" (00) (60) (3.6.39) 
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Fig. 3.18. The entropy as a function Fig. 3.19. The free enthalpy as a 
of a parameter x, with the equilibrium function of a parameter. 


value xo. 


The first-order terms vanish, therefore in first order we find for 6: 


6G=0 for T and P fixed ® (3.6.38¢) 


One terms this condition stationarity. Since G is minimal at xo, we find 
G(x) > 0. (3.6.40) 


Analogously, one can show for the free energy (Helmholtz free energy) F = 
FE —TS and for the enthalpy H = E+ PV that: 


dF < —SdT — PdV (3.6.41a) 
and 
dH <TdS+VdP. (3.6.42a) 


These potentials also tend towards minimum values at equilibrium under the 
condition that their natural variables are held constant: 


dF <0 for T and V fixed (3.6.41b) 
and 

dH <0 for S and P fixed . (3.6.42b) 
As conditions for equilibrium, it then follows that 

oF =0 for T and V fixed (3.6.41c) 
and 


dH =0 for S and P fixed . (3.6.42c) 


° This condition plays an important role in physical chemistry, since in chemical 
processes, the pressure and the temperature are usually fixed. 
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*3.6.5 Thermodynamic Inequalities Derived from Maximization of 
the Entropy 


We consider a system whose energy is F and whose volume is V. We decom- 
pose this system into two equal parts and investigate a virtual change of the 
energy and the volume of subsystem 1 by dF; and dV,. Correspondingly, the 
values for subsystem 2 change by —dF, and —6V\. The overall entropy before 
the change is 


S(E,V) = Si (5.5) es (5-5) (3.6.43) 


Therefore, the change of the entropy is given by 


6S = 54 (5 6B, m) (5 ree m) S(E,V) 


eee 


dE, aE, OV, OV, 
1 (PS, , 82 >, 1 (OS, , 2S, ; 
D (FR | rat) (551) 5 (FF | ar) (Mi) 
8S, 82S, 
i: So) SEWN + 


(3.6.44) 


From the stationarity of the entropy, 6S = 0, it follows that the terms which 
are linear in 6£, and 6V, must vanish. This means that in equilibrium the 
temperature TJ and the pressure P of the subsystems must be equal 


Ty = T> 4 P, = Py, ; (3.6.45a) 


this is a result that is already familiar to us from equilibrium statistics. 

If we permit also virtual variations of the particle numbers, dN; and 
—dNj,, in the subsystems 1 and 2, then an additional term enters the second 
line of (3.6.44): c - oe) 6.Nj; and one obtains as an additional condition 


for equilibrium the equality of the chemical potentials: 


Here, the two subsystems could also consist of different phases (e.g. solid and 
liquid). 

We note that the second derivatives of 5; and S2 in (3.6.44) are both 
to be taken at the values E£/2, V/2 and they are therefore equal. In the 
equilibrium state, the entropy is maximal, according to (3.6.36b). From this 
it follows that the coefficients of the quadratic form (3.6.44) obey the two 
conditions 


124 3. Thermodynamics 


OS,  0?So 
ee eee 6. 
SEF = DEF <° (3.6.46a) 


and 


2 2 2 2 
sae ea ( oo ) >0. (3.6.46b) 


JE? OV2 \dE\OV 


We now leave off the index 1 and rearrange the left side of the first condition: 


os OF 1 
OE? (53) ge Ge eee) 


The left side of the second condition, Eq. (3.6.46b), can be represented by a 
Jacobian, and after rearrangement, 


05m 3v) _ O(n 7) _ Az 


O(E,V) ——O(E,V) A(T 


7) O(T,V) 
JV 


eet) (3.6.47b) 


SAS POR A. A ge fo ol 

7 T3 OV T Cy 7 T3V krCy ; 
If we insert the expressions (3.6.47a,b) into the inequalities (3.6.46a) and 
(3.6.46b), we obtain 


Cy>0, #r>0, (3.6.48a,b) 


which expresses the stability of the system. When heat is given up, the system 
becomes cooler. On compression, the pressure increases. 

Stability conditions of the type of (3.6.48a,b) are expressions of Le Chate- 
lier’s principle: When a system is in a stable equilibrium state, every sponta- 
neous change in its parameter leads to reactions which drive the system back 
towards equilibrium. 

The inequalities (3.6.48a,b) were already derived in Sect. 3.3 on the basis 
of the positivity of the mean square deviations of the particle number and 
the energy. The preceding derivation relates them within thermodynamics to 
the stationarity of the entropy. The inequality Cy > 0 guarantees thermal 
stability. If heat is transferred to part of a system, then its temperature 
increases and it releases heat to its surroundings, thus again decreasing its 
temperature. If its specific heat were negative, then the temperature of the 
subsystem would decrease on input of heat, and more heat would flow in from 
its surroundings, leading to a further temperature decrease. The least input of 
heat would set off an instability. The inequality «7 > 0 guarantees mechanical 
stability. A small expansion of the volume of a region results in a decrease in 
its pressure, so that the surroundings, at higher pressure, compress the region 
again. If however «7 < 0, then the pressure would increase in the region and 
the volume element would continue to expand. 
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3.7 Cyclic Processes 


The analysis of cyclic processes played an important role in the historical 
development of thermodynamics and in the discovery of the Second Law of 
thermodynamics. Even today, their understanding is interesting in principle 
and in addition, it has eminent practical significance. Thermodynamics makes 
statements concerning the efficiency of cyclic processes (periodically repeating 
processes) of the most general kind, which are of importance both for heat 
engines and thus for the energy economy, as well as for the energy balance of 
biological systems. 


3.7.1 General Considerations 


In cyclic processes, the working substance, i.e. the system, returns at intervals 
to its initial state (after each cycle). For practical reasons, in the steam engine, 
and in the internal combustion engine, the working substance is replenished 
after each cycle. We assume that the process takes place quasistatically; thus, 
we can characterize the state of the system by two thermodynamic variables, 
e.g. P and V or T and S. The process can be represented as a closed curve 
in the P-V or the T-S plane (Fig. 3.20). 


Fig. 3.20. A cyclic process: 
V S (a) in the P-V diagram; (b) in 
a) b) the T-S diagram 


The work which is performed during one cycle is given by the line integral 
along the closed curve 


W=-W= $ PV =A, (3.7.1) 


which is equal to the enclosed area A within the curve representing the cyclic 
process in the P-V diagram. 
The heat taken up during one cycle is given by 


Q= fas =A. (3.7.2) 


Since the system returns to its initial state after a cycle, thus in particular 
the internal energy of the working substance is unchanged, it follows from 
the principle of conservation of energy that 


Q=w. (3.7.3) 
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The heat taken up is equal to the work performed on the surroundings. The 
direction of the cyclic path and the area in the P-V and T-S' diagrams are 
thus the same. When the cyclic process runs in a clockwise direction (right- 
handed process), then 


OY Q=W>0 (3.7.4a) 


and one refers to a work engine. In the case that the process runs counter- 
clockwise (left-handed process), we have 


O Q=W<0 (3.7.4b) 


and the machine acts as a heat pump or a refrigerator. 


3.7.2 The Carnot Cycle 


The Carnot cycle is of fundamental importance; its P-V and T-S diagrams 
are shown in Fig. 3.21. 

We initially discuss the process which runs clockwise, i.e. the work en- 
gine. The starting point is point A in the diagram. The cycle is divided into 
four operations: an isothermal expansion, an adiabatic expansion, an isother- 
mal compression, and an adiabatic compression. The system is alternately 
connected to heat baths at temperatures T> and 7T,, where 7) > 7), and in 
between it is insulated. The motion of the piston is shown in Fig. 3.22. 


Fig. 3.21. A Carnot 
cycle in (a) the P-V di- 
agram and (b) the T-S' 
diagram 


Fig. 3.22. The se- 


@ eee ® HHH quence of the Carnot 


cycle 
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1. Isothermal expansion: the system is brought into contact with the warmer 
heat bath at the temperature T>. The quantity of heat 


Q2 = T2(S2 — $1) (3.7.5a) 


is taken up from the bath, while at the same time, work is performed on the 
surroundings. 

2. Adiabatic expansion: the system is thermally insulated. Through an 
adiabatic expansion, work is performed on the outer world and the working 
substance cools from T> to the temperature T}. 

3. Isothermal compression: the working substance is brought into thermal 
contact with the heat bath at temperature JT; and through work performed 
on it by the surroundings, it is compressed. The quantity of heat “taken up” 
by the working substance 


Qi =T1(S1 — S2) <0 (3.7.5b) 


is negative. That is, the quantity |Qi| of heat is given up to the heat bath. 
4. Adiabatic compression: employing work performed by the outside 
world, the now once again thermally insulated working substance is com- 
pressed and its temperature is thereby increased to T). 
After each cycle, the internal energy remains the same; therefore, the total 
work performed on the surroundings is equal to the quantity of heat taken 
up by the system, Q = Q) + Qo; thus 


The thermal efficiency (= work performed/heat taken up from the warmer 
heat bath) is defined as 


Ww 


=—_., 3.7.6a 
=o, ( ) 
For the Carnot machine, we obtain 
Ti 
ee 3.7.6b 
NC Th’ ( ) 


where the index C stands for Carnot. We see that nc < 1. The general 
validity of (3.7.6a) cannot be too strongly emphasized; it holds for any kind 
of working substance. Later, we shall show that there is no cyclic process 
whose efficiency is greater than that of the Carnot cycle. 


The Inverse Carnot Cycle 

Now, we consider the inverse Carnot cycle, in which the direction of the 
operations is counter-clockwise (Fig. 3.23). In this case, for the quantities of 
heat taken up from baths 2 and 1, we find 
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Fig. 3.23. The inverse Carnot cycle 


Q2 = T2(S1 — S2) <0 


3.7.7a,b 
Qi = T(S2 — $1) > 0. ( a ) 


The overall quantity of heat taken up by the system, Q, and the work per- 
formed on the system, W, are then given by 


Q = (T, — T2)(S2 — $1) =-W <0. (3.7.8) 


Work is performed by the outside world on the system. The warmer reservoir 
is heated further, and the cooler one is cooled. Depending on whether the 
purpose of the machine is to heat the warmer reservoir or to cool the colder 
one, one defines the heating efficiency or the cooling efficiency. 

The heating efficiency (= the heat transferred to bath 2/work performed) is 


H  —Q2 T 
oS Vi a 


oa (3.7.9) 


Since n# > 1, this represents a more efficient method of heating than the 
direct conversion of electrical energy or other source of work into heat (this 
type of machine is called a heat pump). The formula however also shows 
that the use of heat pumps is reasonable only as long as T> ~ T; when the 
temperature of the heat bath (e.g. the Arctic Ocean) T; < Ta, it follows that 
|Q2| = |W, i.e. it would be just as effective to convert the work directly into 
heat. 

The cooling efficiency (= the quantity of heat removed from the cooler reser- 
voir /work performed) is 


Q_ 7 
W T> _ T) : 
For large-scale technical cooling applications, it is expedient to carry out the 
cooling process in several steps, i.e. as a cascade. 


Je 
Cc 


n (3.7.10) 


3.7.3 General Cyclic Processes 


We now take up a general cyclic process (Fig. 3.24), in which heat exchange 
with the surroundings can take place at different temperatures, not neces- 
sarily only at the maximum and minimum temperature. We shall show, that 
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S V 


Fig. 3.24. The general cyclic process Fig. 3.25. The idealized (full curve) 
and real (dashed curve) sequence of 
the Carnot cycle 


the efficiency 7 obeys the inequality 
NS 1c, (3.7.11) 


where 7c the efficiency of a Carnot cycle operating between the two extreme 
temperatures. 

We decompose the process into sections with heat uptake (6Q > 0) and 
heat output (dQ < 0), and also allow irreversible processes to take place 


w=a=f59= / 5Q + 5Q= 2+. 


5Q>0 5Q<0 >0 <0 


It follows from the Second Law that 


Qf 6, f @~ a 
p . me 5 ee re Bon ae 


Here, for the second inequality sign, we have used the inequality T; < T < T5. 
We thus obtain 


Qi Ti 
se ee 3.7.13 
Qo” Th ( ) 
From this, we find for the efficiency of this process the inequality 
Qi + Q2 Qi Ti 
= -14 7 = ‘ 3.7.14 
” Os 02 7, = "Ne ( ) 


whereby (3.7.11) is proven. The efficiency 7 is only then equal to that of 
the Carnot cycle if the heat transfer occurs only at the minimum and the 
maximum temperatures and if the process is carried out reversibly (the second 
and first inequality signs in Eq. (3.7.12). 

In the case of the real Carnot machine, also, there must be a small dif- 
ference between the internal and the external pressure, in order to cause the 
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process to take place at all (see Fig. 3.25). We recall the considerations at the 
end of Sect. 3.5, which referred to Fig. 3.9. This leads to the result that W is 
given by the area enclosed by the dashed curve. Therefore, the efficiency of 
the real Carnot machine is somewhat less than the maximum value given by 
(3.7.6b). Physics sets a universal limit here to the efficiency of industrially 
applicable heat engines, but also to that of biological systems. 


3.8 Phases of Single-Component Systems 


The different chemical substances within a system are called components. In 
the case of a single chemical substance, in contrast, one refers to a single- 
component system or a pure system. The components of a system can occur in 
different physical forms (structures), which are termed phases. In this section, 
we consider single-component systems. 


3.8.1 Phase-Boundary Curves 


Every substance can occur in several different phases: solid, liquid, gaseous. 
The solid and the liquid phases can further split into other phases with dif- 
fering physical properties. Under which conditions can two phases occur in 
equilibrium with each other? The condition for equilibrium, (2.7.4), or also 
(3.6.45a,b) states that T, P and yw must be equal. Let wi(T, P) and po(T, P, ) 
be the chemical potentials of the first and the second phase; then we have 


p(T, P) = po(T, P). (3.8.1) 
From this, we obtain the phase boundary curve 
P=PT). (3.8.2) 


The coexistence of two phases is possible along a curve in the P-T diagram. 
Examples of phase boundaries are (see Fig 3.26): solid—liquid: the melting 
curve; solid—gaseous: the sublimation curve; liquid—gaseous: the vapor pres- 
sure curve; also called the evaporation curve. Fig. 3.26 shows a phase diagram 
which is typical of most simple substances. 

We first consider the process of evaporation on isobaric heating of the 
liquid, e.g. at the pressure Po in Fig. 3.27a. In the region 1, only the liquid 
is present; at a temperature T(P5) (point 2), the liquid evaporates, and in 
region 3, the substance is present in the gas phase. 

For a complete characterization of the physical situation on the transition 
line, we represent the evaporation process at constant pressure Pp in the T-V 
diagram (Fig. 3.27b). In region 1, only liquid is present, and an input of heat 
leads to an increase of the temperature and thermal expansion, until T'(Pp) is 
reached. Further input of heat then goes into the conversion of liquid into gas 
(region 2). Only when all of the liquid has evaporated does the temperature 


3.8 Phases of Single-Component Systems 131 


liquid 
solid 


evaporation 


sublimation 


gaseous 


Fig. 3.26. The phases of a simple sub- 
stance in the P-T diagram 


liquid & 
gas 


(a) (b) 


Fig. 3.27. The evaporation process: (a) P-T diagram: the vapor-pressure curve; 
(b) T-V diagram: coexistence region, bounded by the coexistence curve (solid line), 
isobaric heating (dashed or dot-dashed) 


once again increase (region 3). In the horizontal part of isobar 2, gas and 
liquid are present as the fractions cq and cz, 


catcp=1. (3.8.3) 
The overall volume is 
V =ceVet+ aM, =ceVe + (l-ca)M, (3.8.4) 


where Ve and Vy, are the volumes of the pure gas and liquid phases at the 
evaporation temperature. It follows that 


V-VU 


Se 3.8.5 
= Ve (3.8.5) 


CG 
If we consider the evaporation process at a different pressure, we find a simi- 
lar behavior. The region of horizontal isobars is called the coexistence region, 
since here the liquid coexists with the gas. This region is bounded by the 
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coexistence curve. When the pressure is increased, the difference between 
the liquid and the gas phase becomes less and the coexistence region nar- 
rows in the T-V diagram. The two branches of the coexistence curve join 
at the critical point, whose temperature and pressure T, and P, are called 
the critical temperature and critical pressure. For water, T. = 647.3 K and 
P. = 221.36 bar. The critical temperatures of some other substances are col- 
lected in Table I.4. At pressures above the critical pressure, there is no phase 
transition between a more dense liquid and a less dense gas phase. In this 
range, there is only a fluid phase which varies continuously with tempera- 
ture. These facts are more clearly represented in a three-dimensional P-V-T 
diagram. 

At temperatures below T., the liquid phase can be reached by isothermal 
compression. At temperatures above T,, there is no phase transition from the 
gaseous to the liquid phase. This fact was first demonstrated with substances 
which are gaseous under normal conditions, Og, No, ..., in gas compression 
experiments at extremely high pressures by Natterer!® (cf. the values of T, in 
Table I.4). The critical state was first investigated by Andrews!! using COg. 

In Fig. 3.28, the three-dimensional P-V-T diagram for a typical simple 
substance like CO2 is drawn. The surface defined by P = P(V,T) is called 
the surface of the equation of state or the PVT surface. The regions of co- 
existence of liquid-gas, solid-liquid, and solid-gas are clearly recognizable. 
This substance contracts upon solidifying. In Fig. 3.28, the projections on 
the P-T plane (i.e. the phase diagram) and on the P-V plane are also shown. 
The numerical values for CO2 are given in the P-T diagram. 

The situation already discussed for the liquid-gas transition is analogous 
in the cases of sublimation and melting; however, for these phase transitions 
there is no critical point. As can be seen from Figs. 3.26 and 3.28b, at the 
triple point, the solid, liquid, and gas phases coexist. In Fig. 3.28a,c, where 
the surface of the equation of state is plotted with respect to the extensive 
variable V, the triple point becomes a triple line (see Sect. 3.8.4). 

In Fig. 3.29, the phase diagram is also shown for a case in which the 
substance expands upon solidifying, as is the case for water. 


Notes: 


a) It is usual to denote a gas in the neighborhood of the vapor-pressure curve as 
vapor. Vapors are simply gases which deviate noticeably from the state of an ideal 
gas, except at very low pressures. A vapor which is in equilibrium with its liquid is 
termed ‘saturated’. 

b) In the technical literature, the vapor-pressure curve is also called the evap- 
oration curve, and the coexistence region is called the saturation region, while the 
coexistence curve is called the saturation curve. On evaporation of a liquid, there 
are also droplets of liquid floating in the vapor within the saturation region. These 


10 T. Natterer, Sitzungsberichte der kaiserlichen Akademie der Wissenschaften, 
mathem.-naturwiss. Classe, Vol. V, 351 (1850) and ibid., Vol. VI, 557 (1851); 
and Sitzungsbericht der Wien. Akad. XII, 199 (1854) 

" Th. Andrews, Philos. Trans. 159, 11, 575 (1869) 
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Fig. 3.28. COz: (a) The surface of the equation of state P = P(V,T) of a substance 

which contracts on freezing. The isotherms are shown as solid curves and the isobars 

are dashed. (b) The P-T diagram (phase diagram). Here, the numerical values for 

COz are given; the drawing is however not to scale. (c) The P-V diagram. 


droplets are called wet vapor. This ‘moist vapor’ vanishes only when the water 
droplets evaporate, leaving a dry saturated vapor (Vg in Fig. 3.27b). The expres- 
sion “saturated” is due to the fact that the least cooling of the vapor leads to the 
formation of water droplets, i.e. the vapor begins to condense. Vapor (gas) in the 
pure gas phase (region 3 in Fig. 3.27) is also termed ‘superheated vapor’. It is in- 
correct to call clouds of floating solid or liquid particles ‘vapor’ or ‘steam’ (e.g. a 
“steaming” locomotive). Such clouds are correctly called fog or condensation clouds. 
Water vapor is invisible. 

c) The right-hand branch of the coexistence curve (see Fig. 3.27b) is also called 
the condensation boundary and the left-hand branch the boiling boundary. Coming 
from the gas phase, the first liquid droplets form at the condensation boundary, and 
coming from the liquid phase, the first gas bubbles form at the boiling boundary. 

d) To elucidate the concept of vapor pressure, we consider the following demon- 
stration: take a cylindrical vessel containing a liquid, e.g. water, in its lower section. 
A movable, airtight piston is initially held directly above the water surface. If the 
piston is raised, keeping the temperature constant, then just enough water will 
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Fig. 3.29. H2O: (a) The surface of the equation of state of a substance which 
expands on freezing. (b) The P-T diagram (phase diagram). Here, the numerical 
values for HzO are shown. The diagram is however not drawn to scale. (c) The 
P-V diagram. 
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evaporate to produce a certain pressure in the free space which opens above the 
water surface, independent of the volume of this space. The vapor is saturated 
within this space. If the piston is again lowered, the vapor is not compressed, but 
rather just enough of it condenses into the liquid phase to keep the vapor phase 
saturated;cf. the isotherm in Fig. 3.29a. 

e) The pressure (more precisely, the partial pressure; see p. 155) of the saturated 
vapor above its liquid is nearly independent of whether other, different gases are 
present above the liquid, e.g. air. Evaporation in this situation will be treated in 
more detail later in Sect. 3.9.4.1. 


3.8.2 The Clausius—Clapeyron Equation 
3.8.2.1 Derivation 


According to the discussion of the preceding section, in general the volume 
and the entropy of the substance change upon passing through a phase bound- 
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ary curve. The Clausius—Clapeyron equation gives a relation between this 
change and the slope of the phase-boundary curve. These quantities are re- 
lated to each other because the equality of the chemical potentials (3.8.1) 
also implies the equality of the derivatives of the chemical potentials along 
the phase boundary curve, and the latter can be expressed in terms of the 
(specific) volumes and entropies. 

In order to derive the Clausius—Clapeyron equation, we insert into the 
equilibrium condition (3.8.1) its solution, (3.8.2), i.e. the phase-boundary 
curve Po(T): 


ii(T, Po(T)) = 2(T, Po(T)) , 
and then take the derivative with respect to T, 
(3) | (33) dPo _ (32) | (3) dP (3.8.6) 
OT } p OP), aT OT } p OP), aT 
We recall the two thermodynamic relations dG = —SdT + VdP + wdN and 


G = p(T, P)N, which are valid within each of the two homogeneous phases, 
from which it follows that 


_ Ou — (On 
S=- (SE) : V= (55) (3.8.7) 


Applying this to the phases 1 and 2 with the chemical potentials ju; and fio, 
we obtain from (3.8.6) 


dPy AS 

T= av’ (3.8.8) 
where the entropy and volume changes 

AS = S2,-—S; and AV=V-V, (3.8.9a,b) 


have been defined. Here, S;,2 and Vj,2 are the entropies and volumes of the 
substance consisting of N molecules in the phases 1 and 2 along the boundary 
curve. AS and AV are the entropy and volume changes as a result of the 
phase transition of the whole substance. The Clausius—Clapeyron equation 
(3.8.8) expresses the slope of the phase-boundary curve in terms of the ratio 
of the entropy and volume changes in the phase transition. The latent heat 
Qz is the quantity of heat which is required to convert the substance from 
phase 1 to phase 2: 


QL =TAS. (3.8.10) 


Inserting this definition into (3.8.8), we obtain the Clausius—Clapeyron equa- 
tion in the following form: 
dPp Qn 
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Remarks: 


(i) Frequently, the right-hand side of the Clausius-Clapeyron equation, 
(3.8.8) or (3.8.11), is expressed in terms of the entropies (latent heats) and 
volumes of 1 g or 1 Mole of a substance. 

(ii) In the transition from the low-temperature phase (1) to the high- 
temperature phase (2), AV can be either positive or negative; however, it 
always holds that AS > 0. In this connection we recall the process of isobaric 
heating discussed in Sect. 3.8.1. In the coexistence region, the temperature 
T remains constant, since the heat put into the system is consumed by the 
phase transition. From (3.8.10), Q, =TAS > 0, it follows that AS > 0. This 
can also be read off Fig. 3.34b, whose general form results from the concavity 
of Gand ($4), =-S <0. 


3.8.2.2 Example Applications of the Clausius—Clapeyron 
Equation: 


We now wish to give some interesting examples of the application of the 
Clausius—Clapeyron equation. 


(i) Liquid — gaseous: since, according to the previous considerations, AS > 
0 and the specific volume of the gas is larger than that of the liquid, AV > 0, 
it follows that oh > 0, i.e. the boiling temperature increases with increasing 
pressure (Table 1.5 and Figs. 3.28(b) and 3.29(b)). 

Table I.6 contains the heats of vaporization of some substances at their boiling 
points under standard pressure, i.e. 760 Torr. Note the high value for water. 

(ii) Solid — liquid: in the transition to the high-temperature phase, we have 
always AS > 0. Usually, AV > 0; then it follows that £ > 0. In the 
case of water, AV < 0 and thus £ < 0. The fact that ice floats on water 
implies via the Clausius—Clapeyron equation that its melting point decreases 
on increasing the pressure (Fig. 3.29). 


Note: There are a few other substances which expand on melting, e.g. mercury 
and bismuth. The large volume increase of water on melting (9.1%) is related to 
the open structure of ice, containing voids (the bonding is due to the formation 
of hydrogen bonds between the oxygen atoms, cf. Fig. 3.30). Therefore, the liquid 
phase is more dense. Above 4°C above the melting point Tin, the density of water 
begins to decrease on cooling (water anomaly) since local ordering occurs already 
at temperatures above Tn. 

While as a rule a solid material sinks within its own liquid phase (melt), ice floats 
on water, in such a way that about 9/10 of the ice is under the surface of the water. 
This fact together with the density anomaly of water plays a very important role 
in Nature and is fundamental for the existence of life on the Earth. 

The volume change upon melting if ice is Vi, — Vs = (1.00 — 1.091) cm3/g = 
—0.091cm*g~'. The latent heat of melting per g is Q@ = 80cal/g = 80 x 
42.7 atmcm*/g. From this, it follows that the slope of the melting curve of ice 
near 0°C is 


dP 80 x 42.7 atm 
dT 273 x 0.091 K 


= —138atm/K . (3.8.12) 
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Fig. 3.30. The hexagonal structure of 
ice. The oxygen atoms are shown; they are 
connected to four neighbors via hydrogen 
bonds 


The melting curve as a function of the temperature is very steep. It requires a pres- 
sure increase of 138 atm to lower the melting temperature by 1 K. This “freezing- 
point depression”, small as it is, enters into a number of phenomena in daily life. 
If a piece of ice at somewhat below 0° C is placed under increased pressure, it at 
first begins to melt. The necessary heat of melting is taken from the ice itself, and 
it therefore cools to a somewhat lower temperature, so that the melting process is 
interrupted as long as no more heat enters the ice from its surroundings. This is the 
so-called regelation of ice (= the alternating melting and freezing of ice caused by 
changes in its temperature and pressure). Pressing together snow, which consists 
of ice crystals, to make a snowball causes the snow to melt to a small extent due 
to the increased pressure. When the pressure is released, it freezes again, and the 
snow crystals are glued together. The slickness of ice is essentially due to the fact 
that it melts at places where it is under pressure, so that between a sliding object 
and the surface of the ice there is a thin layer of liquid water, which acts like a 
lubricant, explaining e.g. the gliding motion of an ice skater. Part of the plastic- 
ity of glacial ice and its slow motion, like that of a viscous liquid, are also due to 
regelation of the ice. The lower portions of the glacier become movable as a result 
of the pressure from the weight of the ice above, but they freeze again when the 
pressure is released. 


(iii) He, liquid — solid: the phase diagram of ?He is shown schematically 
in Fig. 3.31. At low temperatures, there is an interval where the melting 
curve falls. In this region, in the transition from liquid to solid (see the ar- 
row in Fig. 3.31a), oe < 0; furthermore, it is found experimentally that the 
volume of the solid phase is smaller than that of the liquid (as is the usual 
case), AV < 0. We thus find from the Clausius-Clapeyron equation (3.8.8) 
AS > 0, as expected from the general considerations in Remark (ii). 


The Pomeranchuk effect: The fact that within the temperature interval 
mentioned above, the entropy increases on solidification is called the Pomer- 
anchuk effect. It is employed for the purpose of reaching low temperatures 
(see Fig. 3.31b). Compression (dashed line) of liquid ?He leads to its solidifi- 
cation and, because of AS > 0, to the uptake of heat. This causes a decrease 
in the temperature of the substance. Compression therefore causes the phase 
transition to proceed along the melting curve (see arrow in Fig. 3.31b). 
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Fig. 3.31. The phase diagram of *He. (a) Isobaric solidification in the range where 
4° <0. (b) Pomeranchuk effect 


This effect can be used to cool *He; with it, temperatures down to 2x 1073 K 
can be attained. The Pomeranchuk effect, however, has nearly no practical 
significance in low-temperature physics today.The currently most important 
methods for obtaining low temperatures are *He-*He dilution (2 x 1073 —5 x 
10~° K) and adiabatic demagnetization of copper nuclei (1.5 x 107° — 12 x 
10~° K), where the temperatures obtained are shown in parentheses. 


(iv) The sublimation curve: We consider a solid (1), which is in equilibrium 
with a classical, ideal gas (2). For the volumes of the two phases, we have 
Vi < Vo; then it follows from the Clausius—Clapeyron equation (3.8.11) that 


dP Qi 

dT TV’ 
where Qry, represents the latent heat of sublimation. For V2, we insert the 
ideal gas equation, 


dP Q.iP 
dl kNT2° 


(3.8.13) 


This differential equation can be immediately integrated under the assump- 
tion that Q; is independent of temperature: 


P=PoeV* ” (3.8.14) 


where q = SE is the heat of sublimation per particle. Equation (3.8.14) yields 
the shape of the sublimation curve under the assumptions used. 


The vapor pressure of most solid materials is rather small, and in fact in most 
cases, no observable decrease with time in the amount of these substances due to 
evaporation is detected. Only a very few solid materials exhibit a readily observable 
sublimation and have as a result a noticeable vapor pressure, which increases with 
increasing temperature; among them are some solid perfume substances. Numerical 
values for the vapor pressure over ice and iodine are given in Tables I.8 and [.9. 
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At temperatures well below 0° C and in dry air, one can observe a gradual disap- 
pearance of snow, which is converted directly into water vapor by sublimation. The 
reverse phenomenon is the direct formation of frost from water vapor in the air, 
or the condensation of snow crystals in the cool upper layers of the atmosphere. If 
iodine crystals are introduced into an evacuated glass vessel and a spot on the glass 
wall is cooled, then solid iodine condenses from the iodine vapor which forms in the 
vessel. Iodine crystals which are left standing in the open air, napthalene crystals 
(“moth balls”), and certain mercury salts, including “sublimate” (HgClz2), among 
others, gradually vanish due to sublimation. 


3.8.3 The Convexity of the Free Energy and the Concavity 
of the Free Enthalpy (Gibbs’ Free Energy) 


We now return again to the gas-liquid transition, in order to discuss some 
additional aspects of evaporation and the curvature of the thermodynamic 
potentials. The coexistence region and the coexistence curve are clearly vis- 
ible in the T-V diagram. Instead, one often uses a P-V diagram. From the 
projection of the three-dimensional P-V-T diagram, we can see the shape 
drawn in Fig. 3.32. From the shape of the isotherms in the P-V diagram, 
the free energy can be determined analytically and graphically. Owing to 
($4), = —P, it follows for the free energy that 


isotherm 
PAV) 


Fig. 3.32. The isotherms Pr(V) and the 
F-F,=F(T.V) -F(T, Ye) = - } dV" PV") free energy as a function of the volume dur- 
Vo ing evaporation; the thin line is the coexis- 

convex shape tence curve 
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Fig. 3.33. The determination of the 
free enthalpy from the free energy by 


G(T,P)=F(T,V)+V-P construction 
Vv 
F(T,V) — F(L£,Vo) = - / dV'Pr(V’). (3.8.15) 
Vo 


One immediately sees that the isotherms in Fig. 3.32 lead qualitatively to the 
volume dependence of the free energy which is drawn below. The free energy 
is convex (curved upwards). The fundamental cause of this is the fact that 
the compressibility is positive: 


OF OP 1 


AVE? OV Re 
while 
QR 
(22) __ (88) cy eo inig 
Vv Vv 


These inequalities are based upon the stability relations proved previously, 
(3.3.5, 3.3.6), and (3.6.48a,b). 

The free enthalpy or Gibbs’ free energy G(T, P) = F' + PV can be con- 
structed from F(T,V). Due to P = — (Se) G(T, P) is obtained from 
F(T,V) by constructing a tangent to F(T,V) with the slope —P (see 
Fig. 3.33). The intersection of this tangent with the ordinate has the co- 
ordinates 


OF 


F(T,V)-V (sr 


) =F+VP=G(T,P). (3.8.17) 
T 


The result of this construction is drawn in Fig. 3.34. 
The derivatives of the free enthalpy 


OG OG 
— J) =V and —) =-S 
(3), : (ar) P 
yield the volume and the entropy. They are discontinuous at a phase tran- 
sition, which results in a kink in the curves. Here, Po(T’) is the evaporation 
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Fig. 3.34. The free enthalpy (Gibbs’ free energy) as a function of (a) the pressure 
and (b) the temperature. 


pressure at the temperature T, and To(P) is the evaporation temperature at 
the pressure P. From this construction, one can also see that the free en- 
thalpy is concave (Fig. 3.34). The curvatures are negative because kr > 0 
and Cp > 0. The signs of the slopes result from V > 0 and S > 0. It is 
also readily seen from the figures that the entropy increases as a result of 
a transition to a higher-temperature phase, and the volume decreases as a 
result of a transition to a higher-pressure phase. These consequences of the 
stability conditions hold quite generally. In the diagrams (3.34a,b), the terms 
gas and liquid phases could be replaced by low-pressure and high-pressure or 
high-temperature and low-temperature phases. 

On melting, the latent heat must be added to the system, on freezing (solidify- 
ing), it must be removed. When heat is put into or taken out of a system at constant 
pressure, it is employed to convert the solid phase to the liquid or vice versa. In the 
coexistence region, the temperature remains constant during these processes. This 
is the reason why in late Autumn and early Spring the temperature near the Earth 
remains close to zero degrees Celsius, the freezing point of water. 


3.8.4 The Triple Point 


At the triple point (Figs. 3.26 and 3.35), the solid, liquid and gas phases 
coexist in equilibrium. The condition for equilibrium of the gaseous, liquid 
and solid phases, or more generally for three phases 1, 2 and 3, is: 


pat, P) zn p2(T, P) oa p3(T, P) ’ (3.8.18) 


and it determines the triple point pressure and the triple point temperature 
Pr, Ty. 

In the P-T diagram, the triple point is in fact a single point. In the 
T-V diagram it is represented by the horizontal line drawn in Fig. 3.35b. 
Along this line, the three phases are in equilibrium. If the phase diagram is 
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Fig. 3.35. The triple point (a) in a P-T diagram (the phases are denoted by 1, 2, 
3. The coexistence regions are marked as 3-2 etc., i.e. denoting the coexistence of 
phase 3 and phase 2 on the two branches of the coexistence curve.); (b) in a T-v 
diagram; and (c) in a v-s diagram 


represented in terms of two extensive variables, such as e.g. by V and S as 
in Fig. 3.35c, then the triple point becomes a triangular area as is visible in 
the figure. At each point on this triangle, the states of the three phases 1, 2, 
and 3 corresponding to the vertices of the triangle coexist with one another. 
We now want to describe this more precisely. Let s1, s2 and s3 be the 

entropies per particle in the phases 1, 2 and 3 just at the triple point, 
Opi 


$= —(3F % [Pee Py and correspondingly, v1, v2, v3 are the specific volumes 
= [ Obi : psa 7 : 
VWe= (3), lope The points (s;,v;) are shown in the s-v diagram as 


points 1, 2, 3. Clearly, every pair of phases can coexist with each other; 
the lines connecting the points 1 and 2 etc. yield the triangle with vertices 1, 
2, and 3. The coexistence curves of two phases, e.g. 1 and 2, are found in the 


s-u diagram from s,;(T) = — (3H) lpr) and v;(T) = (3H). lpr) with 
i = 1 and 2 along with the associated phase-boundary curve P = Po(T). 
Here, the temperature is a parameter; points on the two branches of the co- 
existence curves with the same value of T’ can coexist with each other. The 
diagram in 3.35c is only schematic. The (by no means parallel) lines within 
the two-phase coexistence areas show which of the pairs of single-component 
states can coexist with each other on the two branches of the coexistence 
line. 

Now we turn to the interior of the triangular area in Fig. 3.35c. It is 
immediately clear that the three triple-point phases 1, 2, 3 can coexist with 
each other at the temperature 7; and pressure P; in arbitrary quantities. This 
also means that a given amount of the substance can be distributed among 
these three phases in arbitrary fractions c1, c2, cs (0< Gj <1) 


Ctatcy=l, (3.8.19a) 
and then will have the total specific entropy 
C181 + C282 + ¢383 = § (3.8.19b) 
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and the total specific volume 
C1V1 + cove + 6303 =U. (3.8.19c) 


From (3.8.19a,b,c), it follows that s and v lie within the triangle in Fig. 3.35c. 
Conversely, every (heterogeneous) equilibrium state with the total specific 
entropy s and specific volume v can exist within the triangle, where c), c2, C3 
follow from (3.8.19a-c). Eqns. (3.8.19a-c) can be interpreted by the following 
center-of-gravity rule: let a point (s,v) within the triangle in the v-s diagram 
(see Fig. 3.35c) be given. The fractions cj, c2, cz must be chosen in such a 
way that attributing masses c), C2, c3 to the vertices 1, 2, 3 of the triangle 
leads to a center of gravity at the position (s,v). This can be immediately 
understood if one writes (3.8.19b,c) in the two-component form: 


se &) 7 i ane () es (") (3.8.20) 


Remarks: 


(i) Apart from the center-of-gravity rule, the linear equations can be solved 
algebraically: 


111 111 111 
S S82 83 $1 S 83 $1 52 8 
VU U2 V3 U1 U_ V3 U1, U2 U 
So \eiea-wiai? = (ooo! p= Wedd 
$1 $2 $3 $1 $2 53 $1 $2 $3 
U1 V2 V3 U1 V2 V3 U1 V2 V3 


(ii) Making use of the triple point gives a precise standard for a temperature 
and a pressure, since the coexistence of the three phases can be verified 
without a doubt. From Fig. 3.35c, it can also be seen that the triple point is 
not a point as a function of the experimentally controllable parameters, but 
rather the whole area of the triangle. The parameters which can be directly 
varied from outside the system are not P and T, but rather the volume V 
and the entropy S, which can be varied by performing work on the system or 
by transferring heat to it. If heat is put into the system at the point marked 
by a cross (Fig. 3.35c), then in the example of water, some ice would melt, 
but the state would still remain within the triangle. This explains why the 
triple point is insensitive to changes within wide limits and is therefore very 
suitable as a temperature fixed point. 


(iii) For water, T; = 273.16 K and P, = 4.58 Torr. As explained in Sect. 3.4, 
the absolute temperature scale is determined by the triple point of water. In 
order to reach the triple point, one simply needs to distill highly pure water 
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Fig. 3.36. A triple-point cell: ice, wa- 
ter, and water vapor are in equilibrium 
with each other. A freezing mixture 
in contact with the inner walls causes 
some water to freeze there. It is then 
replaced by the thermometer bulb, and 
a film of liquid water forms on the in- 
ner wall 


into a container and to seal it off after removing all the air. One then has 
water and water vapor in coexistence (coexistence region 1-2 in Fig. 3.35c). 
Removing heat by means of a freezing mixture brings the system into the 
triple-point range. As long as all three phases are present, the temperature 
equals T;, (see Fig. 3.36). 


3.9 Equilibrium in Multicomponent Systems 


3.9.1 Generalization of the Thermodynamic Potentials 


We consider a homogeneous mixture of n materials, or as one says in this 
connection, components, whose particle numbers are Ny, No,..., Nn. We 
first need to generalize the thermodynamic relations to this situation. To 
this end, we refer to Chap. 2. Now, the phase-space volume and similarly 
the entropy are functions of the energy, the volume, and all of the particle 
numbers: 


S = S(E,V,N,...,Npn). (3.9.1) 


All the thermodynamic relations can be generalized to this case by replacing 
N and ps by N; and 4; and summing over 7. We define the chemical potential 
of the 7th material by 


w= —-T (s) (3.9.2a) 
ON; E,V{Nexi} 
and, as before, 


1 ( 5B) P ( Os ) 
ae and —-= | — i (3.9.2b,c) 
ds OE VANg} T OV E,{Ne} 
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Then for the differential of the entropy, we find 


1 P ON bi 
= dE 4 dN; , 9. 
dS = 5 al 2, - (3.9.3) 
and from it the First Law 
dB =TdS — PdV + S— dN; (3.9.4) 
i=1 


for this mixture. 
Die Gibbs—Duhem relation for homogeneous mixtures reads 


B=TS—PV+)S yj. (3.9.5) 
i=1 
It is obtained analogously to Sect. 3.1.3, by differentiating 
ak = E(aS,aV,aNi,...,aNn) (3.9.6) 


with respect to a. From (3.9.4) and (3.9.5), we find the differential form of 
the Gibbs—Duhem relation for mixtures 


—SdT +VdP—S~ Nidu; =0. (3.9.7) 
1=1 


It can be seen from this relation that of the n+ 2 variables (T, P, 111,..-, Un), 
only n+ 1 are independent. 
The free enthalpy (Gibbs’ free energy) is defined by 


G=E-TS+PV. (3.9.8) 
From the First Law, (3.9.4), we obtain its differential form: 
dG =-SdT+VdP+ 5° dN; . (3.9.9) 
i=1 


From (3.9.9), we can read off 


S=-(Fr) wag? Volar) nag? = lam) 
OT) pen} BP }riy? ONi) 7p Nye} 


(3.9.10) 


For homogeneous mixtures, using the Gibbs—Duhem relation (3.9.5) we 
find for the free enthalpy (3.9.8) 


GES" aN, (3.9.11) 
t=1 
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Then we have 


n 


s=-> ($4) N; v= > (38) om. (3.9.12) 


a) 
i=1 P i=1 


The chemical potentials are intensive quantities and therefore depend only 
on T, P and the n — 1 concentrations cy = x, recigt Cy ay = Nn=t (N = 
a Ni, Ch = 1 Cisse Cn—1)- 

The grand canonical potential is defined by 


6=E-TS—-)S wNj. (3.9.13) 


i=l 


For its differential, we find using the First Law (3.9.4) 


db = —SdT — PdV —S— Nid - (3.9.14) 


i=l 


For homogeneous mixtures, we obtain using the Gibbs—Duhem relation 
(3.9.5) 


@=—-PV. (3.9.15) 


The density matrix for mixtures depends on the total Hamiltonian and will 
be introduced in Chap. 5. 


3.9.2 Gibbs’ Phase Rule and Phase Equilibrium 


We consider n chemically different materials (components), which can be in 
r phases (Fig. 3.37) and between which no chemical reactions are assumed 
to take place. The following equilibrium conditions hold: 

Temperature T and pressure P must have uniform values in the whole 
system. Furthermore, for each component 7, the chemical potential must be 
the same in each of the phases. 

These equilibrium conditions can be derived directly by considering the 
microcanonical ensemble, or also from the stationarity of the entropy. 


solid, 


Fig. 3.37. Equilibrium between 3 phases 
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(i) As a first possibility, let us consider a microcanonical ensemble consist- 
ing of n chemical substances, and decompose it into r parts. Calculating the 
probability of a particular distribution of the energy, the volume and the par- 
ticle numbers over these parts, one obtains for the most probable distribution 
the equality of the temperature, pressure and the chemical potentials of each 
component. 

(ii) As a second possibility for deriving the equilibrium conditions, one can 
start from the maximization of the entropy in equilibrium, (3.6.36b) 


1 n 
dS > (az +Pdv-S~ pd) ; (3.9.16) 


i=1 


and can then employ the resulting stationarity of the equilibrium state for 
fixed E, V, and {Nj}, 


55 =0 (3.9.17) 


with respect to virtual variations. One can then proceed as in Sect. 3.6.5, 
decomposing a system into two parts 1 and 2, and varying not only the 
energy and the volume, but also the particle numbers [see (3.6.44)]: 


(08; OS» (9S: — OSs 
oa & Tm) obi (F ma) ov 


| 0S, OS, \ 
+S > (oe = ONG ok 


a 


(3.9.18) 


Here, Nii (Ni,2) is the particle number of component 7 in the subsystem 1 
(2). 
From the condition of vanishing variation, the equality of the temperatures 
and pressures follow: 

T,=Th, PR=P (3.9.19) 


and furthermore ie = ae i.e. the equality of the chemical potentials 


Hit = Mig fort=1,...,n. (3.9.20) 


We have thus now derived the equilibrium conditions formulated at the 
beginning of this section, and we wish to apply them to n chemical substances 
in r phases (Fig. 3.37). In particular, we want to find out how many phases can 
coexist in equilibrium. Along with the equality of temperature and pressure 
in the whole system, from (3.9.20) the chemical potentials must also be equal, 


(3.9.21) 
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The upper index refers to the phases, and the lower one to the components. 
Equations (3.9.21) represent all together n(r—1) conditions on the 2+(n—1)r 
variables (T, P, cf), ash ok. risk 8”, nee ,0l”),). 

The number of quantities which can be varied (i.e. the number of degrees of 


freedom is therefore equal to f = 2+ (n —1)r—n(r —- 1): 
fS2 UA e (3.9.22) 


This relation (3.9.22) is called Gibbs’ phase rule. 

In this derivation we have assumed that each substance is present in all 
r phases. We can easily relax this assumption. If for example substance 1 
is not present in phase 1, then the condition on iw does not apply. The 
particle number of component 1 then also no longer occurs as a variable in 
phase 1. One thus has one condition and one variable less than before, and 


Gibbs’ phase rule (3.9.22) still applies.1? 


Examples of Applications of Gibbs’ Phase Rule: 


(i) For single-component system, n = 1: 


r = 1, f = 2 T, P free 
r = 2, f = 1 P=P(T) Phase-boundary curve 
r = 3, f = O Fixed point: triple point. 


(ii) An example for a two-component system, n = 2, is a mixture of sal 
ammoniac and water, NH4Cl+H20. The possible phases are: water vapor (it 
contains practically no NH4Cl), the liquid mixture (solution), ice (containing 
some of the salt), the salt (containing some H20). 


Possible coexisting phases are: 


e liquid phase: r= 1, f = 3 (variables P, T, c) 

e liquid phase + water vapor: r = 2, f = 2, variables P, T; the concentration 
is a function of P and T: c=c(P,T). 

e liquid phase + water vapor + one solid phase: r = 3, f = 1. Only one 
variable, e.g. the temperature, is freely variable. 

e liquid phase + vapor + ice + salt: r= 4, f = 0. This is the eutectic point. 


The phase diagram of the liquid and the solid phases is shown in Fig. 3.38. 
At the concentration 0, the melting point of pure ice can be seen, and at c = 1, 
that of the pure salt. Since the freezing point of a solution is lowered (see 
Chap. 5), we can understand the shape of the two branches of the freezing- 
point curve as a function of the concentration. The two branches meet at the 
eutectic point. In the regions ice-liq., ice and liquid, and in liq.-salt, liquid and 
salt coexist along the horizontal lines. The concentration of NH4Cl in the ice 


2 The number of degrees of freedom is a statement about the intensive variables; 
there are however also variations of the extensive variables. For example, at 
a triple point, f = 0, the entropy and the volume can vary within a triangle 
(Sect. 3.8.4). 
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is considerably lower than in the liquid mixture which is in equilibrium with 
it. The solid phases often contain only the pure components; then the left- 
hand and the right-hand limiting lines are identical with the two vertical lines 
at c= 0 and c= 1. At the eutectic point, the liquid mixture is in equilibrium 
with the ice and with the salt. If the concentration of a liquid is less than that 
corresponding to the eutectic point, then ice forms on cooling the system. In 
this process, the concentration in the liquid increases until finally the eutectic 
concentration is reached, at which the liquid is converted to ice and salt. The 
resulting mixture of salt and ice crystals is called the eutectic. At the eutectic 
concentration, the liquid has its lowest freezing point. 


_._ 3 : : 
liquid-salt Fig. 3.38. The phase diagram 


of a mixture of sal ammoniac 
(ammonium chloride) and wa- 
ter. In the horizontally shaded 


Dome regions, ice and liquid, liquid 
ice-salt and solid salt, and finally ice 
and solid salt coexist with each 

c other. 


The phase diagram in Fig. 3.38 for the liquid and solid phases and the 
corresponding interpretation using Gibbs’ phase rule can be applied to the fol- 
lowing physical situations: (i) when the pressure is so low that also a gaseous 
phase (not shown) is present; (ii) without the gas phase at constant pressure 
P, in which case a degree of freedom is unavailable; or (iii) in the presence 
of air at the pressure P and vapor dissolved in it with the partial pressure 
cP.!° The concentration of the vapor c in the air enters the chemical poten- 
tial as log cP (see Chap. 5). It adjusts itself in such a way that the chemical 
potential of the water vapor is equal to the chemical potential in the liquid 
mixture. It should be pointed out that owing to the term logc, the chemical 
potential of the vapor dissolved in the air is lower than that of the pure vapor. 
While at atmospheric pressure, boiling begins only at 100°C, and then the 
whole liquid phase is converted to vapor, here, even at very low temperatures 
a sufficient amount enters the vapor phase to permit the logc term to bring 
about the equalization of the chemical potentials. 

The action of freezing mixtures becomes clear from the phase diagram 3.38. 
For example, if NaCl and ice at a temperature of 0°C are brought together, then 
they are not in equilibrium. Some of the ice will melt, and the salt will dissolve 
in the resulting liquid water. Its concentration is to be sure much too high to 
be in equilibrium with the ice, so that more ice melts. In the melting process, 


13 Gibbs’ phase rule is clearly still obeyed: compared to (ii), there is one component 
(air) more and also one more phase (air-vapor mixture) present. 
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heat is taken up, the entropy increases, and thus the temperature is lowered. This 
process continues until the temperature of the eutectic point has been reached. 
Then the ice, hydrated salt, NaCl-2H2O, and liquid with the eutectic concentration 
are in equilibrium with each other. For NaCl and H2O, the eutectic temperature is 
—21°C. The resulting mixture is termed a freezing mixture. It can be used to hold 
the temperature constant at —21°C. Uptake of heat does not lead to an increase of 
the temperature of the freezing mixture, but rather to continued melting of the ice 
and dissolution of NaCl at a constant temperature. 


Eutectic mixtures always occur when there is a miscibility gap between 
the two solid phases and the free energy of the liquid mixture is lower than 
that of the two solid phases (see problem 3.28). the melting point of the 
eutectic mixture is then considerably lower than the melting points of the 
two solid phases (see Table I.10). 


3.9.3 Chemical Reactions, Thermodynamic Equilibrium and the 
Law of Mass Action 


In this section we consider systems with several components, in which the 
particle numbers can change as a result of chemical reactions. We first de- 
termine the general condition for chemical equilibrium and then investigate 
mixtures of ideal gases. 


3.9.3.1 The Condition for Chemical Equilibrium 
Reaction equations, such as for example 
2H2 + O2 = 2H20 , (3.9.23) 


can in general be written in the form 
S 0 u;Aj =0, (3.9.24) 
j=l 


where the A; are the chemical symbols and the stoichiometric coefficients v; 
are (small) integers, which indicate the participation of the components in 
the reaction. We will adopt the convention that left indicates positive and 
right negative. 

The reaction equation (3.9.24) contains neither any information about the 
concentrations at which the A; are present in thermodynamic and chemical 
equilibrium at a given temperature and pressure, nor about the direction in 
which the reaction will proceed. The change in the Gibbs free energy (= free 
enthalpy) with particle number at fixed temperature T and fixed pressure P 
for single-phase systems is!* 


™ Chemical reactions in systems consisting of several phases are treated in M.W. 
Zemansky and R.H. Dittman, Heat and Thermodynamics, Mc Graw Hill, Auck- 
land, Sixth Edition, 1987. 
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dG = S~ yjaN; . (3.9.25) 


jet 


In equilibrium, the N; must be determined in such a way that G remains 
stationary, 


SS" ujdN; = 0. (3.9.26) 
j=l 


If an amount dM participates in the reaction, then dN; = v;dM. The con- 
dition of stationarity then requires 


Savy = 0. (3.9.27) 
j=l 


For every chemical reaction that is possible in the system, a relation of this 
type holds. It suffices for a fundamental understanding to determine the 
chemical equilibrium for a single reaction. The chemical potentials 1;(T, P) 
depend not only on the pressure and the temperature, but also on the rela- 
tive particle numbers (concentrations). The latter adjust themselves in such 
a way in chemical equilibrium that (3.9.27) is fulfilled. 

In the case that substances which can react chemically are in thermal 
equilibrium, but not in chemical equilibrium, then from the change in Gibbs’ 
free energy, 


6G = 5) u;(T, P)v;M (3.9.25’) 


J 


we can determine the direction which the reaction will take. Since G is a min- 
imum at equilibrium, we must have 6G < 0; cf. Eq. (3.6.38b). The chemical 
composition is shifted towards the direction of smaller free enthalpy or lower 
chemical potentials. 


Remarks: 


(i) The condition for chemical equilibrium (3.9.27) can be interpreted to 
mean that the chemical potential of a compound is equal to the sum of the 
chemical potentials of its constituents. 


(ii) The equilibrium condition (3.9.27) for the reaction (3.9.24) holds also 
when the system consists of several phases which are in contact with each 
other and between which the reactants can pass. This is shown by the equality 
of the chemical potential of each component in all of the phases which are in 
equilibrium with each other. 

(iii) Eq. (3.9.27) can also be used to determine the equilibrium distribution 
of elementary particles which are transformed into one another by reactions. 
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For example, the distribution of electrons and positrons which are subject to 
pair annihilation, e~ +e+ = ¥, can be found (see problem 3.31). These appli- 
cations of statistical mechanics are important in cosmology, in the description 
of the early stages of the Universe, and for the equilibria of elementary- 
particle reactions in stars. 


3.9.3.2 Mixtures of Ideal Gases 


To continue the evaluation of the equilibrium condition (3.9.27), we require 
information about the chemical potentials. In the following, we consider re- 
actions in (classical) ideal gases. In Sect. 5.2, we show that the chemical 
potential of particles of type 7 in a mixture of ideal molecular gases can be 
written in the form 


Mj = f(T) +kTlogesP , (3.9.28a) 


where c; = ant holds and N is the total number of particles. The function 
f;() depends solely on temperature and contains the microscopic parameters 
of the gas of type j. From (3.9.27) and (3.9.28a), it follows that 


10 Cec ae (3.9.29) 
J 


According to Sect. 5.2, Eq. (5.2.4’) is valid: 
f(T) = €&,; — cp gT logkT — kTG; . (3.9.28b) 


Inserting (3.9.28b) into (3.9.29) yields the product of the powers of the con- 
centrations: 


<0 


BD) (uryEsersvilk p-Ess (3.9.30) 


ll? = K(T, P) = e=i¥i(G- 
Jj 


where £2, ; 18 the electronic energy, cp,; the specific heat of component 7 at 


constant pressure, and ¢; is the chemical constant 


am?! m 
; = log ——__1____ 3.9.31 

Ci og kO,. ; (2rh?)3/2 ( ) 
Here, we have assumed that 0, < T « 0,, with O, and O, the characteris- 
tic temperatures for the rotational and vibrational degrees of freedom, Eqs. 
(5.1.11) and (5.1.17). Equation (3.9.30) is the law of mass action for the con- 
centrations. The function K(T,P) is also termed the mass action constant. 
The statement that |] ; c, is a function of only T and P holds generally for 
ideal mixtures y;(T, P, {ci}) = p(T, P,c; = 1, =0( #7) + kT loge;. 
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If, instead of the concentrations, we introduce the partial pressures (see 
remark (i) at the end of this section) 


Pj =c,P, (3.9.32) 


then we obtain 


0) 
( ¢,— f2bd 
[[ 2 = Ket) =e (eB) (yy Esersn rt, (3.9.30') 
J 


the law of mass action of Guldberg and Waage!> for the partial pressures, 
with K p(T) independent of P. 
We now find e.g. for the hydrogen-oxygen reaction of Eq. (3.9.23) 


2H» + Oo — 2H20 =0, 
with 
Vy = 2 ; YOg = 1 5 VH2O0 = —2 5 (3.9.33) 


the relation 


H 2 
K(T, P) = Fea Od _ const. en a/#T TEs ersvs/® p-1 (3.9.34) 
[H20]? 
Here, the concentrations c; = [A,] are represented by the corresponding 


chemical symbols in square brackets, and we have used 
q= 2en, +, = 2€t1,0 >0, 


the heat of reaction at absolute zero, which is positive for the oxidation of 
hydrogen. The degree of dissociation a is defined in terms of the concentra- 
tions: 


a 
[H20] =1—a, [Oe] = 5? [H2] =a. 
It then follows from (3.9.32) that 
a? —@/kT MS, cp,jvj3/k p-1 
20 — a)? ~e T J dd P 5 (3.9.35) 


from which we can calculate a; a decreases exponentially with falling tem- 
perature. 


‘© The law of mass action was stated by Guldberg and Waage in 1867 on the 
basis of statistical considerations of reaction probabilities, and was later proved 
thermodynamically for ideal gases by Gibbs, who made it more specific through 
the calculation of K(T, P). 
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The law of mass action makes important statements about the conditions 
under which the desired reactions can take place with optimum yields. It 
may be necessary to employ a catalyst in order to shorten the reaction time; 
however, what the equilibrium distribution of the reacting components will 
be is determined simply by the reaction equation and the chemical potentials 
of the constituents (components) — in the case of ideal gases, by Eq. (3.9.30). 

The law of mass action has many applications in chemistry and technol- 
ogy. As just one example, we consider here the pressure dependence of the 
reaction equilibrium. From (3.9.30), it follows that the pressure derivative of 
K(T, P) is given by 


1 OK = ek = 
K OP pL 


(3.9.36a) 


where v = )°,1; is the so called molar excess. From the equation of state 
of mixtures of ideal gases (Eq. (5.2.3)), PV = kT 5°, Ni, we obtain for the 
changes AV and AN which accompany a reaction at constant T and P: 


PAV =kT SAN, . (3.9.37a) 


4 


Let the number of individual reactions be AN, i.e. AN; = 14;AN, then it 
follows from (3.9.37a) that 


1 A 
Si y= ae (3.9.37b) 


Taking AN = L (the Loschmidt/Avagadro number), then 1; moles of each 
component will react and it follows from (3.9.36a) and (3.9.37b) with the gas 
constant R that 


1 OK AV 
K OP RT’ (3.9.36b) 
Furthermore, AV = oe Vi,Vmol is the volume change in the course of the re- 
action proceeding from right to left (for a reaction which is represented in the 
form (3.9.23)). (The value of the molar volume Vino is the same for every ideal 
gas.) According to Eq. (3.9.36b) in connection with (3.9.30), a larger value 
of K leads to an increase in the concentrations c; with positive v;, i.e. of 
those substances which are on the left-hand side of the reaction equation. 
Therefore, from (3.9.36b), a pressure increase leads to a shift of the equilib- 
rium towards the side of the reaction equation corresponding to the smaller 
volume. When AV = 0, the position of the equilibrium depends only upon 
the temperature, e.g. in the hydrogen chloride reaction Hz + Cle = 2HCl. 
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In a similar manner, one finds for the temperature dependence of K(T, P) 
the result 


OT RT? ~~ RT?2° 


(3.9.38) 


Here, h; is the molar enthalpy of the substance i and Ah is the change of the 
overall molar enthalpy when the reaction runs its course one time from right 
to left in the reaction equation, c.f. problem 3.26. 

An interesting and technically important application is Haber’s synthesis 
of ammonia from nitrogen and hydrogen gas: the chemical reaction 


N + 3H) = 2NH3 (3.9.39) 


is characterized by 1N2 + 3H2 — 2NH3 = 0 (v = 0, 4% = 2): 


3 

Cy. € 
es = K(T, PY = Kp(T\P (3.9.40) 

CNHs 


To obtain a high yield of NH3, the pressure must be made as high as possible. 
Sommerfeld:'® “The extraordinary success with which this synthesis is now 
carried out in industry is due to the complete understanding of the conditions 
for thermodynamic equilibrium (Haber), to the mastery of the engineering 
problems connected with high pressure (Bosch), and, finally, to the successful 
selection of catalyzers which promote high reaction rates (Mittasch).” 


Remarks: 
(i) The partial pressures introduced in Eq. (3.9.32), P; = ¢;P, with c; = 


N,/N, in accord with the equation of state of a mixture of ideal gases, 
Eq. (5.2.3), obey the equations 


VP; = NjkT and | ae ee (3.9.41) 


(This fact is known as Dalton’s Law: the non-interacting gases in the mixture 
produce partial pressures corresponding to their particle numbers, as if they 
each occupy the entire available volume.) 

(ii) Frequently, the law of mass action is expressed in terms of the particle 
densities pj = N;/V: 


[f= (2) = (kD) =" K p(T) . (3.9.30’) 


‘6 A. Sommerfeld, Thermodynamics and Statistical Mechanics: Lectures on Theo- 
retical Physics, Vol. V (Academic Press, New York, 1956), p. 86 
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(iii) Now we turn to the direction which a reaction will take. If a mixture is 
initially present with arbitrary densities, the direction in which the reaction 
will proceed can be read off the law of mass action. Let 11,12,...,Us5 be 
positive and v.41, Vs42,---,Un negative, so that the reaction equation (3.9.24) 
takes on the form 

n 


i=st+l1 


Assume that the product of the particle densities obeys the inequality 


Il“ = aa ey A as (3.9.42) 
Jl ei 


i.e. the system is not in chemical equilibrium. If the chemical reaction pro- 
ceeds from right to left, the densities on the left will increase, and the fraction 
in the inequality will become larger. Therefore, in the case (3.9.42), the reac- 
tion will proceed from right to left. If, in contrast, the inequality was initially 
reversed, with a > sign, then the reaction would proceed from left to right. 


(iv) All chemical reactions exhibit a heat of reaction, i.e. they are accompa- 
nied either by heat release (exothermic reactions) or by taking up of heat (en- 
dothermic reactions). We recall that for isobaric processes, AQ = AH, and 
the heat of reaction is equal to the change in the enthalpy; see the comment 
following Eq. (3.1.12). The temperature dependence of the reaction equilib- 
rium follows from Eq. (3.9.38). A temperature increase at constant pressure 
shifts the equilibrium towards the side of the reaction equation where the 
enthalpy is higher; or, expressed differently, it leads to a reaction in the di- 
rection in which heat is taken up. As a rule, the electronic contribution O (eV) 
dominates. Thus, at low temperatures, the enthalpy-rich side is practically 
not present. 


*3.9.4 Vapor-pressure Increase by Other Gases 
and by Surface Tension 


3.9.4.1 The Evaporation of Water in Air 


As discussed in detail in Sect. 3.8.1, a single-component system can evaporate 
only along its vapor-pressure curve Po(T), or, stated differently, only along 
the vapor-pressure curve are the gaseous and the liquid phases in equilibrium. 
If an additional gas is present, this means that there is one more degree of 
freedom in Gibbs’ phase rule, so that a liquid can coexist with its vapor even 
outside of Po(T). 

Here, we wish to investigate evaporation in the presence of additional 
gases and in particular that of water under an air atmosphere. To this end 
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we assume that the other gas is dissolved in the liquid phase to only a neg- 
ligible extent. If the chemical potential of the liquid were independent of the 
pressure, then the other gas would have no influence at all on the chemical 
potential of the liquid; the partial pressure of the vapor would then have 
to be identical with the vapor pressure of the pure substance — a statement 
which is frequently made. In fact, the total pressure acts on the liquid, which 
changes its chemical potential. The resulting increase of the vapor pressure 
will be calculated here. 
To begin, we note that 


OL =, V 


is small, owing to the small specific volume vy, = + of the liquid. When 
the pressure is changed by AP, the chemical potential of the liquid changes 
according to 


p(T, P+ AP) = uw (T, P) + vx. AP + O(AP’). (3.9.44) 
From the Gibbs—Duhem relation, the chemical potential of the liquid is 
by =e, —-Tsyt+ Pu. (3.9.45) 


Here, ey and sy, refer to the internal energy and the entropy per particle. 
When we can neglect the temperature and pressure dependence of ey, sz, 
and vz, then (3.9.44) is valid with no further corrections. 

The chemical potential of the vapor, assuming an ideal mixture!”, is 


Lvapor(T’, P) = po(T) + kT log cP , (3.9.46) 
where c is the concentration of the vapor in the gas phase, c = ,>——3>-_ Nesp = 


The vapor-pressure curve Po(T) without additional gases follows from 
ui(T, Po) = o(T) + kT log Po . (3.9.47) 


With an additional gas, the pressure is composed of the pressure of the other 
gas Pother and the partial pressure of the vapor, Pyapor = cP; all together, 
P = Pother + Prapor- Then the equality of the chemical potentials in the liquid 
and the gaseous phases is expressed by 


ur(T, Pother + Prapor) = to (T) + kT log Pyapor . 


Subtracting (3.9.47) from this, we find 


1T See Sect. 5.2 
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Po 

Pra; or — Po 

pepe 2 eee ee 
Po 


Pra or Pi 
(TL, Pother + Prapor) ~ ui(T; Po) = kT log (ze + 1) 


UL (Poin + Pyapor _ Po) ~ 


kT 
UL Pother = (= = «.) (Prapor = Po) 


Pother 
Pyapor Po =. suveth = aD (P Prapor) : (3.9.48) 
UG — UL UG — UL 


From the second term in Eq. 3.9.48, it follows that the increase in vapor 
pressure is given approximately by Pyapor — Po(T) ¥ ae other, and the exact 
expression is found to be 

Poe RTs ae =< PTV (3.9.49) 
The partial pressure of the vapor is increased relative to the vapor-pressure 
curve by *4x (P—P(T’)). Due to the smallness of the factor 7+, the partial 
pressure is still to a good approximation the same as the vapor pressure at 
the temperature T’. The most important result of these considerations is the 
following: while a liquid under the pressure P at the temperature T is in 
equilibrium with its vapor phase only for P = Po(T); that is, for P > Po(T) 
(or at temperatures below its boiling point) it exists only in liquid form, it is 
also in equilibrium in this region of (P,T’) with its vapor when dissolved in 
another gas. 
We now discuss the evaporation of water or the sublimation of ice under an atmo- 
sphere of air, see Fig. 3.39. The atmosphere predetermines a particular pressure 
P. At each temperature T below the evaporation temperature determined by this 
pressure (P > Po(T)), just enough water evaporates to make its partial pressure 
equal that given by (3.9.49) (recall Pyapor = cP). The concentration of the water 
vapor is c = (Po(T) 4 oe (P — Po(T)))/P. 

In a free air atmosphere, the water vapor is transported away by diffusion or 
by convection (wind), and more and more water must evaporate (vaporize).'® On 


Fig. 3.39. The vapor pressure Pyapor 
lies above the vapor-pressure curve 
Po(T) (dot-dashed curve) 


18 As already mentioned, the above considerations are also applicable to sublima- 
tion. When one cools water at 1 atm below 0°C, it freezes to ice. This ice at 
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increasing the temperature, the partial pressure of the water increases, until finally 
it is equal to P. The vaporization which then results is called boiling. 

For P = Po(T), the liquid is in equilibrium with its pure vapor. Evaporation 
then occurs not only at the liquid surface, but also within the liquid, in particular 
at the walls of its container. There, bubbles of vapor are formed, which then rise to 
the surface. Within these vapor bubbles, the vapor pressure is Po(T'), corresponding 
to the temperature T’. Since the vapor bubbles within the liquid are also subject to 
the hydrostatic pressure of the liquid, their temperature must in fact be somewhat 
higher than the boiling point under atmospheric pressure. If the liquid contains 
nucleation centers (such as the fat globules in milk), at which vapor bubbles can 
form more readily than in the pure liquid, then it will “boil over”. 

The increase in the vapor pressure by increased external pressure, or as one 
might say, by ‘pressing on it’, may seem surprising. The additional pressure causes 
an increase in the release of molecules from the liquid, i.e. an increase in the partial 
pressure. 


3.9.4.2 Vapor-Pressure Increase by Surface Tension of Droplets 


A further additional pressure is due to the surface tension and plays a role 
in the evaporation of liquid droplets. We consider a liquid droplet of radius r. 
When the radius is increased isothermally by an amount dr, the surface area 
increases by 8ar dr, which leads to an energy increase of o8ardr, where a is 
the surface tension. Owing to the pressure difference p between the pressure 
within the droplet and the pressure of the surrounding atmosphere, there is 
a force p4ar? which acts outwardly on the surface. The total change of the 
free energy is therefore 


dF = 6A = o8nrdr — p4nr* dr . (3.9.50) 


In equilibrium, the free energy of the droplet must be stationary, so that for 
the pressure difference we find the following dependence on the radius: 


_ 20 


p (3.9.51) 


rp 
Thus, small droplets have a higher vapor pressure than larger one. The vapor- 
pressure increase due to the surface tension is from Eq. (3.9.48) now seen to 
be 

= 20 UL 


Pyapor =~ Po(T) = = paar (3.9.52) 


inversely proportional to the radius of the droplet. In a mixture of small and 
large droplets, the smaller ones are therefore consumed by the larger ones. 


e.g. —10°C is to be sure as a single-component system not in equilibrium with 
the gas phase, but rather with the water vapor in the atmosphere at a partial 
pressure of about Po(—10°C), where Po(7’) represents the sublimation curve. For 
this reason, frozen laundry dries, because ice sublimes in the atmosphere. 
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Remarks: 


(i) Small droplets evaporate more readily than liquids with a flat surface, 
and conversely condensation occurs less easily on small droplets. This is the 
reason why extended solid cooled surfaces promote the condensation of wa- 
ter vapor more readily than small droplets do. The temperature at which 
the condensation of water from the atmosphere onto extended surfaces (dew 
formation) takes place is called the dew point. It depends on the partial pres- 
sure of water vapor in the air, i.e. its degree of saturation, and can be used 
to determine the amount of moisture in the air. 

(ii) We consider the homogeneous condensation of a gas in free space without 
surfaces. The temperature of the gas is taken to be T and the vapor pressure 
at this temperature to be Po(T'). We assume that the pressure P of the gas is 
greater than the vapor pressure; it is then referred to as supersaturated vapor. 
For each degree of supersaturation, then, a critical radius can be defined from 
(3.9.52): 


_. UE. 20 
~ ug (P— Po(T)) 


Ter 


For droplets whose radius is smaller than r,, the vapor is not supersaturated. 
Condensation can therefore not take place through the formation of very 
small droplets, since their vapor pressures would be higher than P. Some 
critical droplets must be formed through fluctuations in order that condensa- 
tion can be initiated. Condensation is favored by additional attractive forces; 
for example, in the air, there are always electrically-charged dust particles and 
other impurities present, which as a result of their electrical forces promote 
condensation, i.e. they act as nucleation centers for condensation. 


Problems for Chapter 3 


3.1 Read off the partial derivatives of the internal energy E with respect to its 
natural variables from Eq. (3.1.3). 


3.2 Show that 
dg = adx + Body 


is not an exact differential: a) using the integrability conditions and b) by integra- 
tion from P; to P2 along the paths C; and C2. Show that 1/x is an integrating 
factor, df = dg/z. 


3.3 Prove the chain rule (3.2.13) for Jacobians. 


3.4 Derive the following relations: 


Cp = KT OT _ T OP and OT = T OV 
Cy Ks : OV s Cy OT Vv OP s Cp OT ae 
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x Fig. 3.40. Paths in the z-y diagram 


3.5 Determine the work performed by an ideal gas, W(V) = i dV P during a 
reversible adiabatic expansion. From dQ = 0, it follows that dE = —PdV, and from 
this the adiabatic equations for an ideal gas can be obtained: T’ = ra)” 2 and 


/ 
P = NkT,“% 


2/3 
7373: They can be used to determine the work performed. 


3.6 Show that the stability conditions (3.6.48a,b) follow from the maximalization 
of the entropy. 


3.7 One liter of an ideal gas expands reversibly and isothermally at (20°C) from 
an initial pressure of 20 atm to 1 atm. How large is the work performed in Joules? 
What quantity of heat @ in calories must be transferred to the gas? 


3.8 Show that the ratio of the entropy increase on heating of an ideal gas from T 
to Tz at constant pressure to that at constant volume is given by the ratio of the 
specific heats. 


3.9 A thermally insulated system is supposed to consist of 2 subsystems (T'4, Va, P) 
and (Tz, Ve, P), which are separated by a movable, diathermal piston (Fig. 3.41(a). 
The gases are ideal. 

(a) Calculate the entropy change accompanying equalization of the temperatures 
(irreversible process). 

(b) Calculate the work performed in a quasistatic temperature equalization; 
cf. Fig. 3.41(b). 


Fig. 3.41. For problem 3.9 


3.10 Calculate the work obtained, W = § PdV, in a Carnot cycle using an ideal 
gas, by evaluating the ring integral. 


3.11 Compare the cooling efficiency of a Carnot cycle between the temperatures 
T, and T2 with that of two Carnot cycles operating between T; and T3 and between 
T3 and Tp (Ti < T3 < T2). Show that it is more favorable to decompose a cooling 
process into several smaller steps. 


162 3. Thermodynamics 


3.12 Discuss a Carnot cycle in which the working ‘substance’ is thermal radiation. 
For this case, the following relations hold: E = 0oVT*, pV = sE, o> 0. 
(a) Derive the adiabatic equation. (b) Compute Cy and Cp. 


3.13 Calculate the efficiency of the Joule cycle (see Fig. 3.42): 
Result : 7 = 1—(Po/Pi)“~?/* 


Compare this efficiency with that of the Carnot cycle (drawn in dashed lines), using 
an ideal gas as working substance. 


P adiabats 
: 
P 
Vv Fig. 3.42. The Joule cycle 


3.14 Calculate the efficiency of the Diesel cycle (Fig. 3.43) Result: 


ed 1 (V2/Vi)* — (V3/Vi)* 
kK (V2/Vi) — (V3/M1) 


adiabats 


Fig. 3.43. The Diesel cycle 


3.15 Calculate for an ideal gas the change in the internal energy, the work per- 
formed, and the quantity of heat transferred for the quasistatic processes along the 
following paths from 1 to 2 (see Fig. 3.44) 

(a) 1-A-2 

(b) 1-B-2 

(c) 1-C-2. What is the shape of the E(P,V) surface? 


> isochore — 


isobaric 
0 Vv Fig. 3.44. For problem 3.15 
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3.16 Consider the socalled Stirling cycle, where a heat engine (with an ideal gas 
as working substance) performs work according to the following quasistatic cycle: 
(a) isothermal expansion at the temperature T; from a volume V, to a volume V2. 
(b) cooling at constant volume V2 from T; to To. 

(c) isothermal compression at the temperature T2 from V2 to Vi. 

(d) heating at constant volume from T> to 7}. 

Determine the thermal efficiency 7 of this process! 


3.17 The ratio of the specific volume of water to that of ice is 1.000:1.091 at 0°C 
and 1 atm. The heat of melting is 80 cal/g. Calculate the slope of the melting curve. 


3.18 Integrate the Clausius-Clapeyron differential equation for the transition 
liquid-gas, by making the simplifying assumption that the heat of transition is 
constant, Viiquia can be neglected in comparison to Vgas, and that the equation of 
state for ideal gases is applicable to the gas phase. 


3.19 Consider the neighborhood of the triple point in a region where the limiting 
curves can be approximated as straight lines. Show that a < 7m holds (see Fig. 3.45). 
Hint: Use dP/dT = AS/AV, and the fact that the slope of line 2 is greater than 
that of line 3. 


Fig. 3.45. The vicinity of a triple point 


3.20 The latent heat of ice per unit mass is Qr. A container holds a mixture 
of water and ice at the freezing point (absolute temperature To). An additional 
amount of the water in the container (of mass m) is to be frozen using a cooling 
apparatus. The heat output from the cooling apparatus is used to heat a body of 
heat capacity C' and initial temperature To. What is the minimum quantity of heat 
energy transferred from the apparatus to the body? (Assume C’ to be temperature 
independent). 


3.21 (a) Discuss the pressure dependence of the reaction No2+3H2 = 2NH3 (am- 
monia synthesis). At what pressure is the yield of ammonia greatest? 

(b) Discuss the thermal dissociation 2H20 = 2H2+Oz2. Show that an increase in 
pressure works against the dissociation. 

3.22 Give the details of the derivation of Eqs. (3.9.36a) and (3.9.36b). 

3.23 Discuss the pressure and temperature dependence of the reaction 


CO + 3He = CH4+ HeO. 


3.24 Apply the law of mass action to the reaction H2+Clz = 2HCI. 
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3.25 Derive the law of mass action for the particle densities 
pj =Nj/V (Eq. (3.9.30')) . 


3.26 Prove Eq. (3.9.38) for the temperature dependence of the mass-action con- 
stant. 


Hint: Show that H =G—T82 =T? (2) 
and express the change in the free enthalpy 


AG = ye bili 


using Eq. (3.9.28), then insert the law of mass action (3.9.30) or (3.9.30’). 


3.27 The Pomeranchuk effect. The entropy diagram for solid and liquid He? has 
the shape shown below 3 K. Note that the specific volumes of both phases do not 
change within this temperature range. Draw P(T) for the coexistence curves of the 
phases. 


Fig. 3.46. The Pomeranchuk effect 


3.28 The (specific) free energies fa and fg of two solid phases a and @ with a 
miscibility gap and the (specific) free energy fz of the liquid mixture are shown as 
functions of the concentration c in Fig. 3.47. 

Discuss the meaning of the dashed and solid double tangents. On lowering 
the temperature, the free energy of the liquid phase is increased, i.e. fr is shifted 
upwards relative to the two fixed branches of the free energy. Derive from this the 
shape of the eutectic phase diagram. 


Fig. 3.47. Liquid mixture 
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3.29 A typical shape for the phase diagram of liquid and gaseous mixtures is shown 
in Fig. 3.48. 

The components A and B are completely miscible in both the gas phase and the 
liquid phase. B has a higher boiling point than A. At a temperature in the interval 
Ta < T < Tp, the gas phase is therefore richer in A than the liquid phase. Discuss 
the boiling process for the initial concentration co 
(a) in the case that the liquid remains in contact with the gas phase: show that 
vaporization takes place in the temperature interval To to Te. 

(b) in the case that the vapor is pumped off: show that the vaporization takes place 
in the interval To to Tp. 


0 
A B 


Fig. 3.48. Bubble point and dew point lines 


Remark: The curve which is made by the boiling curve (evaporation limit) and 
the condensation curve together form the bubble point and dew point lines, a lens- 
shaped closed curve. Its shape is of decisive importance for the efficiency of distilla- 
tion processes. This ‘boiling lens’ can also take on much more complex shapes than 
in Fig. 3.48, such as e.g. that shown in Fig. 3.49. A mixture with the concentration 
Ca is called azeotropic. For this concentration, the evaporation of the mixture oc- 
curs exactly at the temperature J, and not in a temperature interval. The eutectic 
concentration is also special in this sense. Such a point occurs in an alcohol-water 
mixture at 96%, which limits the distillation of alcohol.'® 


I 
gas 
liquid 
py | ee 
0 Ca 1 Fig. 3.49. Bubble point and dew point lines 


18 Detailed information about phase diagrams of mixtures can be found in 
M. Hansen, Constitution of Binary Alloys, McGraw Hill, 1983 und its supple- 
ments. Further detailed discussions of the shape of phase diagrams are to be 
found in L. D. Landau and E.M. Lifshitz, Course of Theoretical Physics, Vol. V, 
Statistical Physics, Pergamon Press 1980. 
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3.30 The free energy of the liquid phase, fz, is drawn in Fig. (3.50) as a function 
of the concentration, as well as that of the gas phase, fg. It is assumed that fr 
is temperature independent and fg shifts upwards with decreasing temperature 
(Fig. 3.50). Explain the occurrence of the ‘boiling lens’ in problem 3.29. 


Fig. 3.50. Free energy 


3.31 Consider the production of electron-positron pairs, 
efte = An 


Assume for simplicity that the chemical potential of the electrons and positrons 
is given in the nonrelativistic limit, taking the rest energy into account, by w = 


me + kT log aN Show that for the particle number densities n+ of e~ that 


2me2 


N4n- = Se kT 


holds and discuss the consequences. 


3.32 Consider the boiling and condensation curves of a two-component liquid mix- 
ture. Take the concentrations in the gaseous and liquid phases to be cq and cr. 
Show that at the points where cq = cr (the azeotropic mixture) i.e. where the 
boiling and condensation curves come together, for a fixed pressure P the following 
relation holds: 


dT 

ie |) 

dc , 
and for fixed T 

dP 

geo 


thus the slopes are horizontal. 
Method: Start from the differential Gibbs-Duhem relations for the gas and the 
liquid phases along the limiting curves. 
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3.33 Determine the temperature of the atmosphere as a function of altitude. How 
much does the temperature decrease per km of altitude? Compare your result for 
the pressure P(z) with the barometric formula (see problem 2.15). 

Method: Start with the force balance on a small volume of air. That gives 


Assume that the temperature changes depend on the pressure changes of the air 


(ideal gas) adiabatically ae = wt ee) From this, one finds aT(2) 


. Numerical 
values: m = 29 g/mole, y = 1.41. 


3.34 In meteorology, the concept of a “homogeneous atmosphere” is used, where 
p is taken to be constant. Determine the pressure and the temperature in such an 
atmosphere as functions of the altitude. Calculate the entropy of the homogeneous 
atmosphere and compare it with that of an isothermal atmosphere with the same 
energy content. Could such a homogeneous atmosphere be stable? 


4. Ideal Quantum Gases 


In this chapter, we want to derive the thermodynamic properties of ideal 
quantum gases, i.e. non-interacting particles, on the basis of quantum statis- 
tics. This includes nonrelativistic fermions and bosons whose interactions may 
be neglected, quasiparticles in condensed matter, and relativistic quanta, in 
particular photons. 


4.1 The Grand Potential 


The calculation of the grand potential is found to be the most expedient way 
to proceed. In order to have a concrete system in mind, we start from the 
Hamiltonian for N non-interacting, nonrelativistic particles, 


N 1 ; 
H=) i 5oPi (4.1.1) 
t=1 


We assume the particles to be enclosed in a cube of edge length LZ and vol- 
ume V = L?, and apply periodic boundary conditions. The single-particle 
eigenfunctions of the Hamiltonian are then the momentum eigenstates |p) 
and are given in real space by 


Yp(x) = (x|p) = = een, (4.1.2a) 


where the momentum quantum numbers can take on the values 


27h 
p= (11, 2,0) 5 Va = 0,+1,... 5 (4.1.2b) 


and the single-particle kinetic energy is given by 


2 
Pp 
= ; 4.1.2 
Ep mM ( c) 
For the complete characterization of the single-particle states, we must still 
take the spin s into account. It is integral for bosons and half-integral for 


fermions. The quantum number m, for the z-component of the spins has 
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2s+1 possible values. We combine the two quantum numbers into one symbol, 
p =(p,m,) and find for the complete energy eigenstates 


Ip) = |p) |ms) . (4.1.2d) 


In the treatment which follows, we could start from arbitrary non- 
interacting Hamiltonians, which can also contain a potential and can depend 
on the spin, as is the case for electrons in a magnetic field. We then still 
denote the single-particle quantum numbers by p and the eigenvalue belong- 
ing to the energy eigenstate |p) by ¢,, but it need no longer be the same as 
(4.1.2c). These states form the basis of the N-particle states for bosons and 
fermions: 


pis P2,---,PN) =NS0(41)” P|pi)... |p) - (4.1.3) 
P 


Here, the sum runs over all the permutations P of the numbers 1 to N. 
The upper sign holds for bosons, (+1)? = 1, the lower sign for fermions. 
(—1)? is equal to 1 for even permutations and —1 for odd permutations. The 
bosonic states are completely symmetric, the fermionic states are completely 
antisymmetric. As a result of the symmetrization operation, the state (4.1.3) 
is completely characterized by its occupation numbers n,, which indicate how 
many of the N particles are in the state |p). For bosons, n, = 0,1,2,... can 
assume all integer values from 0 to oo. These particles are said to obey Bose— 
Einstein statistics. For fermions, each single-particle state can be occupied at 
most only once, n, = 0,1 (identical quantum numbers would yield zero due 
to the antisymmetrization on the right-hand side of (4.1.3)). Such particles 
are said to obey Fermi—Dirac statistics. The normalization factor in (4.1.3) 
is NV = —\ for fermions and N = (N! np,!np,!...)~1/? for bosons.! 


VN! 
For an N-particle state, the sum of all the np obeys 


ee ge one (4.1.4) 


and the energy eigenvalue of this N-particle state is 


Eli, i= Se ripen (4.1.5) 


We can now readily calculate the grand partition function (Sect. 2.7.2): 


' Note: for bosons, the state (4.1.3) can also be written in the form 
(N1/np,!npy!...)71/? > p P’ \pi)...|pw), where the sum includes only those 
permutations P’ which lead to different terms. 
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Za= S- S- eo BECK })-EN) — ys eB Up Ep-H) rp 
N=0 {np} {np} 
ip Mp=N 
1 
——._——~ for bosons 
{= e7Blep—H) 
= II bs e Blep—H)tp — Pp (4.1.6) 
Pp Np II (1 + ePler-H) | for fermions . 


Pp 
We give here some explanations relevant to (4.1.6). Here, >> inp} > = 
Dl ae ... refers to the multiple sum over all occupation numbers, whereby 
each occupation number n, takes on the allowed values (0,1 for fermions and 
0,1,2, ... for bosons). In this expression, p = (p,ms) runs over all values 
of p and m,. The calculation of the grand partition function requires that 
one first sum over all the states allowed by a particular value of the particle 
number N, and then over all particle numbers, N = 0,1,2,.... In the defi- 
nition of ZG, )¢,,, therefore enters with the constraint }/,,p = N. Since 
however in the end we must sum over all N , the expression after the second 
equals sign is obtained; in it, the sum runs over all n, independently of one 
another. Here, we see that it is most straightforward to calculate the grand 
partition function as compared to the other ensembles. For bosons, a product 
of geometric series is obtained in (4.1.6); the condition for their convergence 
requires that fz < €p for all p. 
The grand potential follows from (4.1.6): 


$= —-6"'logZg=+6"' S- log (1 + ele) 5 (4.1.7) 
P 


from which we can derive all the thermodynamic quantities of interest. Here, 
and in what follows, the upper (lower) signs refer to bosons (fermions). For 
the average particle number, we therefore find 


ae CG) = > nlep) , (4.1.8) 


Pp 
where we have introduced 
1 


eB (Eph) ae | (4.1.9) 


n(Ep) = 


these are also referred to as the Bose or the Fermi distribution functions. We 
now wish to show that n(e,) is the average occupation number of the state 
\q). To this end, we calculate the average value of ng: 


yee e FX, len, ae e Bnalea-H) 2, 


(nq) = Tr(pang) = a eo Fhe eT — ae =D 


0 —x“n 
=~ Fhe ye = nea) 


r=B(eq—pH) 
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which demonstrates the correctness of our assertion. We now return to the 
calculation of the thermodynamic quantities. For the internal energy, we find 
from (4.1.7) 


a POL@E) «2 
E= (“Sr ) 4, = Lenten (4.1.10) 


where in taking the derivative, the product Gy is held constant. 
Remarks: 


(i) In order to ensure that n(e,) > 0 for every value of p, for bosons we require 
that «4 <0, and for an arbitrary energy spectrum, that 4 < min(¢,). 
(ii) For e~®@»—-) <1 and s = 0, we obtain from (4.1.7) 


—_p-1 “Her = Bye —Bp?/2m _ _ 2V_ 
ia Beane | °P° Bx’ 


(4.1.11) 


which is identical to the grand potential of a classical ideal gas, 
Eq. (2.7.23). Here, the dispersion relation ¢, = p?/2m from Eq. (4.1.2c) 
was used for the right-hand side of (4.1.11). In 


Zoe", (4.1.12) 


we have introduced the fugacity, and 4 = Tea (Eq. (2.7.20)) denotes 
the thermal wavelength. For s 4 0, an additional factor of (2s+1) would 
occur after the second and third equals signs in Eq. (4.1.11). 

(iii) The calculation of the grand partition function becomes even simpler if 


we make use of the second-quantization formalism 
Zg = Tr exp(—B(H — uN)) , (4.1.13a) 


where the Hamiltonian and the particle number operator in second quan- 
tization? have the form 


H=) e,aha, (4.1.13b) 
Pp 
and 
Wes alas s (4.1.13¢) 
Pp 
It then follows that 
Zq = Tr] Je Per Mart = TT S- ec Plerwne (4.1.13) 
Pp Pp Np 


and thus we once again obtain (4.1.6). 


? See e.g. F. Schwabl, Advanced Quantum Mechanics, 3"¢ ed. (QM II), Springer, 
2005, Chapter 1. 
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According to Eq. (4.1.2b) we may associate with each of the discrete p values 
a volume element of size A = 27h/L°. Hence, sums over p may be replaced 
by integrals in the limit of large V. For the Hamiltonian of free particles 
(4.1.1), this implies in (4.1.7) and (4.1.8) 


a 0304 aye sf be... (4.1.14a) 


Pp 


with the degeneracy factor 
g=2s+1, (4.1.14b) 


as a result of the spin-independence of the single-particle energy ép. 
For the average particle number, we then find from (4.1.8)? 


4 V ° 
N= g - d°pn(ép) = pulaes dp p’n(€p) 
(27h) 27h 
0 
- gV m3? i de é 
~ 91/27273 | eBle—M) 1” (4.1.15) 
0 


where we have introduced ¢ = p?/2m as integration variable. We also define 
the specific volume 


v=V/N (4.1.16) 


and substitute x = (Ge, finally obtaining from (4.1.15) 


(4.1.17) 


1 i Be i gil? _ 9 § 93/2(2) for bosons 
vi Br etz-141 f3/2(z) for fermions . 
0 


In this expression, we have introduced the generalized ¢-functions, which are 
defined by# 


tt ao 
f(z) 


Similarly, from (4.1.7), we find 


0 


3 For bosons, we shall see in Sect. 4.4 that in a temperature range where jz — 0, 
the term with p = 0 must be treated separately in making the transition from 
the sum over momenta to the yee 


4 The gamma function is defined as ['(v =f dte‘t’-' [Rev > OQ]. It obeys the 
relation (vy +1) =vI(v). 


174 4. Ideal Quantum Gases 


ae Cra / d*p log (1 +E cere) 


(4.1.19) 
2 Vin? —B(e- 
me he Sata | eve (1 =e : Hy 
which, after integration by parts, leads to 
2 gVm3/2 Pf deed! VET { 95/9(z) 
Spy = us 4.1.19! 
3 fas | sen z1 8 foj2(z) ’ ( 9) 
0 


where the upper lines holds for bosons and the lower line for fermions. The 
expression (3.1.26), ® = —PV, which is valid for homogeneous systems, was 
also used here. From (4.1.10) we obtain for the internal energy 


gVm3/? f  dee3/? 
fanny af a DEpn(Ep) = 5172,298 | GAle-w 1° (4.1.20) 
0 
Comparison with (4.1.19’) yields, remarkably, the same relation 
2 
PV =3E (4.1.21) 


as for the classical ideal gas. Additional general relations follow from the 
homogeneity of & in T and p. From (4.1.19’), (4.1.15), and (3.1.18), we obtain 


oa Jf _ _ 5/2 Lt 2 3/2 Lt 
P= wet Tp () ,N=VT'*n () : (4.1.22a,b) 
_ O® _ 3/2. ( S — s(u/T) 


Using these results, we can readily derive the adiabatic equation. The con- 
ditions S = const. and N = const., together with (4.1.22d), (4.1.22b) and 
(4.1.22a), yield /T = const., VT?/? = const., PT—*/? = const., and finally 


PV*/3 — const . (4.1.23) 


The adiabatic equation has the same form as that for the classical ideal 
gas, although most of the other thermodynamic quantities show different 
behavior, such as for example cp/cy 45/3. 

Following these preliminary general considerations, we wish to derive the 
equation of state from (4.1.22a). To this end, we need to eliminate /T from 
(4.1.22a) and replace it by the density N/V using (4.1.22b). The explicit 
computation is carried out in 4.2 for the classical limit, and in 4.3 and 4.4 
for low temperatures where quantum effects predominate. 


4.2 The Classical Limit z= e"/"F <1 175 


4.2 The Classical Limit z = e#4/*T <1 


We first formulate the equation of state in the nearly-classical limit. To do 
this, we expand the generalized ¢-functions g and f defined in (4.1.18) as 
power series in z: 


g(2) = 1 i v-1,-@ ~ 14)\k’ —ak! yk _ . Geljrreee 
aa = TW es pie re > ae , 
(4.2.1) 


where the upper lines (signs) hold for bosons and the lower for fermions. 
Then Eq. (4.1.17) takes on the form 


Ae (+1 z Zz 


UV 
k=1 


This equation can be solved iteratively for z: 


3 1 3\ 2 3\3 
Z= ane z + O 2 : (4.2.3) 
ug 23/2 ug v 
Inserting this in the series for which follows from (4.1.19’) and (4.2.1), 


VkT 2 
P= g x3 (: oe 35/2 + o(:')) : (4.2.4) 


we can eliminate jz in favor of N and obtain the equation of state 


3 Ao \ 7 
PV = -$= NkT (14 35 +0 (=) (4.2.5) 
gu v 


The symmetrization (antisymmetrization) of the wavefunctions causes a re- 
duction (increase) in the pressure in comparison to the classical ideal gas. 
This acts like an attraction (repulsion) between the particles, which in fact 
are non-interacting (formation of clusters in the case of bosons, exclusion 
principle for fermions). For the chemical potential, we find from (4.1.12) and 


(4.2.3), and making use of x <1, the following expansion: 


3 1 3 
a dll 5 (4.2.6) 


p= kTlogz = kT ee oo 37 go’ 


i.e. u <0. Furthermore, for the free energy F = + wN, we find from (4.2.5) 
and (4.2.6) 


Nd» 


f= Faass + bE SS Pagy ’ 


(4.2.7a) 
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where 
Xe 
Fass = NkT (1 + log ~) (4.2.7b) 
gu 


is the free energy of the classical ideal gas. 
Remarks: 


(i) The quantum corrections are proportional to h?, since \ is proportional 
to h. These corrections are also called exchange corrections, as they de- 
pend only on the symmetry behavior of the wavefunctions (see also Ap- 
pendix B). 

(ii) The exchange corrections to the classical results at finite temperatures 
are of the order of \?/v. The classical equation of state holds for z< 1 
or \ < v'/3, ie. in the extremely dilute limit. This limit is the more 
readily reached, the higher the temperature and the lower the density. 
The occupation number in the classical limit is given by (cf. Fig. 4.1) 


3 
n(Ep) we Bepehh — ase <1. (4.2.8) 


This classical limit (4.2.8) is equally valid for bosons and fermions. For 
comparison, the Fermi distribution at J’ = 0 is also shown. Its signifi- 
cance, as well as that of er, will be discussed in Sect. 4.3 (Fig. 4.1). 
(iii) Corresponding to the symmetry-dependent pressure change in (4.2.5), 
the exchange effects lead to a modification of the free energy (4.2.7a). 


n (€) 


Fig. 4.1. The occu- 
pation number n(e) 
in the classical limit 
(shaded). For com- 
parison, the occupa- 
tion of a degenerate 
Fermi gas is also indi- 
cated 


4.3 The Nearly-degenerate Ideal Fermi Gas 


In this and the following section, we consider the opposite limit, in which 
quantum effects are predominant. Here, we must treat fermions and bosons 
separately in Sect. 4.4. We first recall the properties of the ground state of 
fermions, independently of their statistical mechanics. 
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4.3.1 Ground State, T = 0 (Degeneracy) 


We first deal with the ground state of a system of N fermions. It is obtained 
at a temperature of zero Kelvin. In the ground state, the N lowest single- 
particle states |p) are each singly occupied. If the energy depends only on 
the momentum p, every value of p occurs g-fold. For the dispersion relation 
(4.1.2c), all the momenta within a sphere (the Fermi sphere), whose radius 
is called the Fermi momentum pr (Fig. 4.2), are thus occupied. The particle 
number is related to pr as follows: 


V Vp3 
N=9 So 1=9 [@rer-»)= 8 ay (4.3.1) 


DSPF 


(27h)?  6n2h3 


Fig. 4.2. The occupation of the momen- 
tum states within the Fermi sphere 


From (4.3.1), we find the following relation between the particle density 


n= x and the Fermi momentum: 
6 2\ 1/3 
pr= (=) ani | (4.3.2) 
g 


The single-particle energy corresponding to the Fermi momentum is called 
the Fermi energy: 


2 Gr2\ 2/2 72 
spe ={ i ) n2/3 | (4.3.3) 
2m g 2m 


For the ground-state energy, we find 


V 2 Vp, 3 
B= oe | ep > O(pr gy oe RE Ae el Ns (4.3.4) 
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From (4.1.21) and (4.3.4), the pressure of fermions at T’ = 0 is found to be 


2 1 (6n2\7/? 72 
P= zepn= =( ; ) ie (4.3.5) 


The degeneracy of the ground state is sufficiently small that the entropy and 
the product TS vanish at T = 0 (see also (4.3.19)). From this, and using 
(4.3.4) and (4.3.5), we obtain for the chemical potential using the Gibbs— 
Duhem relation p = 4(E + PV —TS): 


ser. (4.3.6) 


This result is also evident from the form of the ground state, which implies 
the occupation of all the levels up to the Fermi energy, from which it follows 
that the Fermi distribution of a system of N fermions at T = 0 becomes 
n(e) = O(er — «€). Clearly, one requires precisely the energy €r in order to 
put one additional fermion into the system. The existence of the Fermi energy 
is a result of the Pauli principle and is thus a quantum effect. 


4.3.2 The Limit of Complete Degeneracy 


We now calculate the thermodynamic properties in the limit of large u/kT. 
In Fig. 4.3, the Fermi distribution function 


1 


ne) = ge-m)/kT 4 1 (4.3.7) 


is shown for low temperatures. In comparison to a step function at the posi- 
tion py, it is broadened within a region kT’. We shall see below that ps is equal 
to er only at T = 0. For T = 0, the Fermi distribution function degenerates 
into a step function, so that one then speaks of a degenerate Fermi gas; at 
low T one refers to a nearly-degenerate Fermi gas. 

It is expedient to replace the prefactors in (4.1.19’) and (4.1.15) with the 
Fermi energy (4.3.3)°; for the grand potential, one then obtains 


f= “News i. de €9/? n(e) , (4.3.8) 
0 


and the formula for N becomes 


1= ser’? f de eV? n(e) . (4.3.9) 
0 


° In (4.3.8) and (4.3.14), ® is expressed as usual in terms of its natural variables 


T, V and yp, since Ne,” x V. In (4.3.14), the dependence on ys has been 
substituted by T and N/V, using (4.3.13). 
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Fig. 4.3. The Fermi distribu- 
tion function n(e) for low tem- 
peratures, compared with the step 
function O(p — €). 


Fig. 4.4. The Fermi dis- 
tribution function n(e), and 


n(e) — O(uw—e). 
There thus still remain integrals of the type 
io fe FOO (4.3.10) 
0 


to be computed. The method of evaluation at low temperatures was given by 
Sommerfeld; J can be decomposed in the following manner: 


I= fe Hevu fe fle)[n(e) — Ou —6)] 
- ee (4.3.11) 
S fe fle) + i de f(c)[n(e) — O(u—e)] 
0 


—co 


and for T — 0, the limit of integration in the second term can be extended to 
—oo to a good approximation, since for negative e, n(e) = 1+O(e~(#—-2)/#T) 6 
One can see immediately from Fig. 4.4 that (n(e)—O@(p—e)) differs from zero 
only in the neighborhood of ¢ = 1 and is antisymmetric around pu.” Therefore, 
° Tf f(e) is in principle defined only for positive ¢, one can e.g. define f(—e) = f(e); 
the result depends on f(€) only for positive e. 
7 1 _ 9@(-z) =1- —_ - @(-2) = 1 e(x)] . 


ec +1 e-®4+1 e-*4+1 
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we expand f(e) around the value y in a Taylor series and introduce a new 
integration variable, x = (e — yp) /kT: 


- faery f aft oc] 


(Fw) (Kr) 2+ EY (arya? +...) 


= [ie rey+2nyr » fae art 


kT x 
I my fae 
0 


- e(-2)| is antisymmetric and = 


aT =o for « > 0). From this, 
the general expansion in terms of the temperature follows, making use of the 


integrals computed in Appendix D., Eq. (D.7) ® 


(since 


T1* 


I= fe fle)+ (kT) "(un )+ 30 KE) ia ee (4.3.12) 
0 


Applying this expansion to Eq. (4.3.9), we find 


(oF) +009} 


This equation can be solved iteratively for 4, yielding the chemical potential 
as a function of T and N/V: 


waerft-(Z) sor}, (4.3.13) 


where €p is given by (4.3.3). The chemical potential decreases with increasing 
temperature, since then no longer all the states within the Fermi sphere are 
occupied. In a similar way, we find for (4.3.8) 


_3/2f 2 ; 3 
# = -Negt| 28? * (AP)? Sy? oy (4.3.14) 


8 This series is an asymptotic expansion in T’. An asymptotic series for a function 
(A), TA) = rp aeA* + Rm(A), is characterized by the following behavior of 
the remainder: lim,—.90 Rm(A)/A™ = 0, limm—oo Rm(A) = co. For small values 
of A, the function can be represented very accurately by a finite number of terms 
in the series. The fact that the integral in (4.3.10) for functions f(e) ~ e1/? 
etc. cannot be expanded in a Taylor series can be immediately recognized, since 
I diverges for T < 0. 
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from which, inserting (4.3.13),° 


2 br? (kT\? 
@ = —=Neps1+ — ( — qe 4.3.14! 
: cry +5 (=) +0( )} (4.3.14) 


or using P = —®/V, we obtain the equation of state. From (4.1.21), we find 
immediately the internal energy 


3 3 5a? (kT \? i 
a a sNer{1 tans (=) + O(T )} ; (4.3.15) 


From this, we calculate the heat capacity at constant V and N: 


wT 
Cy = Nk —~— 4.3.16 
4 2 ras ’ ( ) 


where we have introduced the Fermi temperature 
Tr =epr/k . (4.3.17) 


At low temperatures, (T < Tr), the heat capacity is a linear function of the 
temperature (Fig. 4.5). This behavior can be qualitatively understood in a 
simple way: if one increases the temperature from zero to T, the energy of a 
portion of the particles increases by kT. The number of particles which are 
excited in this manner is limited to a shell of thickness kT around the Fermi 
sphere, i.e. it is given by NkT/ep. All together, the energy increase is 


kT 
5E~kTN—, (4.3.16’) 
EF 
from which, as in (4.3.16), we obtain Cy ~ kNT/Tp. According to (4.3.14’), 
the pressure is given by 


2 (6n2\79 h2ON\PR Tn? (KD\? 
P= 14+ — | — met [ss 4.3.14” 
Ma) oly) shan (a) le - aoe 
Due to the Pauli exclusion principle, there is a pressure increase at T = 0 


relative to a classical ideal gas, as can be seen in Fig. 4.6. The isothermal 
compressibility is then 


ee -4 ay = ee f is (os +] (4.3.18) 


° If one requires the grand potential as a function of its natural variables, it is nec- 


essary to substitute Wee = Vg(2m)?/? /6r7h? in (4.3.14). For the calculation 
of Cy and the equation of state, it is however expedient to employ T,V, and N 
as variables. 
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T T 
Fig. 4.5. The specific heat (heat ca- Fig. 4.6. The pressure as a function 
pacity) of the ideal Fermi gas of the temperature for the ideal Fermi 


gas (solid curve) and the ideal classical 
gas (dashed) 


For the entropy, we find for T < Tr 


nT 
=kN—~— — 4.3.1 
S=k 2 Tr (4.3.19) 


with TS = E+ PV — uN from (4.3.15), (4.3.14’) and (4.3.13) (cf. Ap- 
pendix A.1, ‘Third Law’). 

The chemical potential of an ideal Fermi gas with a fixed density can 
be found from Eq. (4.3.9) and is shown in Fig. 4.7 as a function of the 
temperature. 


Fig. 4.7. The chemical potential of 
the ideal Fermi gas at fixed density 
as a function of the temperature. 


Addenda: 
(i) The Fermi temperature, also known as the degeneracy temperature, 
2/3 
EF —38 1 N 
Tr|K] = — =3. 1 — | ——_ 4.3.2 
F [K] i 3.85 x 10 rae (73) (4.3.20) 


characterizes the thermodynamic behavior of fermions (see Table 4.1). For T < Tr, 
the system is nearly degenerate, while for T >> Ty, the classical limit applies. 
Fermi energies are usually quoted in electron volts (eV). Conversion to Kelvins is 
accomplished using 1eV411605 K . 
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(ii) The density of states is defined as 


v(e) = (mh fer d(€—€p). (4.3.21) 


We note that v(<) is determined merely by the dispersion relation and not by statis- 
tics. The thermodynamic quantities do not depend on the details of the momentum 
dependence of the energy levels, but only on their distribution, i.e. on the density 
of states. Integrals over momentum space, whose integrands depend only on €p, can 
be rearranged as follows: 


3 
[@p ter) = fae fap fs -<») = or [ave 


For example, the particle number can be expressed in terms of the density of states 
in the form 


oe) 


N= yh de v(e)n(e) . (4.3.22) 


—oo 


For free electrons, we find from (4.3.21) 


3 
— gV (AM\? 42 _ 3 ell? 
F 


The dependence on e'/? shown in Fig. 4.8 is characteristic of nonrelativistic, non- 
interacting material particles. 


Vv (E) 


V(e)~ ev? 


Fig. 4.8. The density of states for 
free electrons in three dimensions 


The derivations of the specific heat and the compressibility given above can be 
generalized to the case of arbitrary densities of states v(e) by evaluating (4.3.9) 
and (4.3.8) in terms of a general v(e). The results are 


Cy = stun) + O((T/Tr)*) (4.3.24a) 


ie ‘uler) + O((T/Tr)?) . (4.3.24b) 
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The fact that only the value of the density of states at the Fermi energy is of 
importance for the low-temperature behavior of the system was to be expected after 
the discussion following equation (4.3.17). For (4.3.23), we find from (4.3.24a,b) once 
again the results (4.3.16) and (4.3.18). 

(iii) Degenerate Fermi liquids: physical examples of degenerate Fermi liquids are 
listed in Table 4.1. 


Table 4.1. Degenerate Fermi liquids: mass, density, Fermi temperature, Fermi 
energy 


Particles mlg] N/V [cm~*| Tr[K] erleV] 
Metal 0.91 x 10-27 1024 10° < 10 
electrons 
3 _ —24 
ce oe aig 5 (16-28)x10% L714 (1.5-0.9) x 1074 
Neutrons 0.11 x 10° 5.3x 101! 2 
in the Lor x10. x (452) a-z\z 46(454)5 x 10° 
Nucleus A x ( A ) 
Protons ; 3 3 
in the 167x107 (0.11 x 10°42. 5:3 x 10" (4)? 46(4)% & 10° 
Nucleus 
Electrons 
. ae 0.91 x 10-27 10°° 3 x 10° 3 x 10° 

Wi 
Stars 


(iv) Coulomb interaction: electrons in metals are not free, but rather they 
repel each other as a result of their Coulomb interactions 


2 ie (4.3.25) 


The following scaling of the Hamiltonian shows that the approximation of free 
electrons is particularly reasonable for high densities. To see this, we carry 
out the canonical transformation r’ = r/ro, p'’ = pro. The characteristic 
length ro is defined by 42 73N =V,ie. ro = (aye. In terms of these new 
variables, the Hamiltonian is 


= ey Se (4.3.25') 
ge ~~ 2m 85 rf ae 


The Coulomb interaction becomes less and less important relative to the 
kinetic energy the smaller ro, i.e. the more dense the gas becomes. 
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*4.3.3 Real Fermions 


In this section, we will consider real fermionic many-body systems: the con- 
duction electrons in metals, liquid 73He, protons and neutrons in atomic nuclei, 
electrons in white dwarf stars, neutrons in neutron stars. All of these fermions 
interact; however, one can understand many of their properties without tak- 
ing their interactions into account. In the following, we will deal with the 
parameters mass, Fermi energy. and temperature and discuss the modifica- 
tions which must be made as a result of the interactions (see also Table 4.1). 


a) The Electron Gas in Solids 


The alkali metals Li, Na, K, Rb, and Cs are monovalent (with a body-centered 
cubic crystal structure); e.g. Na has a single 3s! electron (Table 4.2). The 
noble metals (face-centered cubic crystal structure) are 


Copper Cu 4s!3d1° 
Silver Ag 5s14d19 
Gold Au 6s'5d!° . 


All of these elements have one valence electron per atom, which becomes a 

conduction electron in the metal. The number of these quasi-free electrons is 

equal to the number of atoms. The energy-momentum relation is to a good 
Pp? 10 


approximation parabolic, ep = $—. 


Table 4.2. Electrons in Metals; Element, Density, Fermi Energy, Fermi Tempera- 
ture, 7/theor., Effective Mass 


N/V [cem~3] er [eV] Tr [K] Y/Ytheor. ™m*/m 
Li 4.6 x 10” 4.7 5.5x 10* 2.17 2.3 
Na 2.5 3.1 3.7 1.21 1.3 
K 1.34 2.1 2.4 1.23 1.2 
Rb 1.08 1.8 2.1 1.22 1.3 
Cs 0.86 1.5 1.8 1.35 1.5 
Cu 8.5 7 8.2 1.39 1.3 
Ag 5.76 5.5 6.4 1.00 1.1 
Au 5.9 5.5 6.4 1.13 TA 
10 Remark concerning solid-state physics applications: for Na, we have a (PE)? = 
an a 3 Varill., where Vprin. is the volume of the first Brillouin zone. The Fermi 


sphere always lies within the Brillouin zone and thus never crosses the zone 
boundary, where there are energy gaps and deformations of the Fermi surface. 
The Fermi surface is therefore in practice spherical, Apr/pr ~ 10-°. Even in 
copper, where the 4s Fermi surface intersects the Brillouin zone of the fcc lattice, 
the Fermi surface remains in most regions spherical to a good approximation. 
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(Specific Heat/ Temperature) wcal/ Kg at, 


0.5 1.0 15 2.0 2.5 
(Temperature K)® 


Fig. 4.9. The experimental determination of y from the specific heat of gold 
(D.L. Martin, Phys. Rev. 141, 576 (1966); ibid. 170, 650 (1968)) 


Taking account of the electron-electron interactions requires many-body 
methods, which are not at our disposal here. The interaction of two electrons 
is weakened by screening from the other electrons; in this sense, it is un- 
derstandable that the interactions can be neglected to a first approximation 
in treating many phenomena (e.g. Pauli paramagnetism; but not ferromag- 
netism). 

The total specific heat of a metal is composed of a contribution from the 
electrons (Fig. 4.9) and from the phonons (lattice vibrations, see Sect 4.6): 


Cy 
— = 7T+DT?. 
ee 


Plotting 4 = y+ DT? vs. T?, we can read ¥ off the ordinate. From (4.3.16), 
the theoretical value of 7 is Ytheor = re The deviations between theory 
and experiment can be attributed to the fact that the electrons move in the 
potential of the ions in the crystal and are subject to the influence of the 
electron-electron interaction. The potential and the electron-electron inter- 
action lead among other things to an effective mass m* for the electrons, 
i.e. the dispersion relation is approximately given by €p = pas This effective 
mass can be larger or smaller than the mass of free electrons. 


b) The Fermi Liquid 7He 


3He has a nuclear spin of J = 4, a mass m = 5.01 x 10~*4g, a particle density 
of n = 1.6 x 10?2cm~3 at P = 0, and a mass density of 0.081gcm~?. It 
follows that er = 4.2 x 10~4eV and Tp = 4.9K. The interactions of the ?He 
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Fig. 4.10. The phase 
diagram of *He 


atoms lead to an effective mass which at the pressures P = 0 and P = 30 
bar is given by m* = 2.8m and m* = 5.5m. Hence the Fermi temperature 
for P = 30, Tr = 1K, is reduced relative to a fictitious non-interacting ?He 
gas. The particle densities at these pressures are n = 1.6 x 1072cm7° and 
n = 2.3 x 10?cm7?. The interaction between the helium atoms is short- 
ranged, in contrast to the electron-electron interaction. The small mass of 
the helium atoms leads to large zero-point oscillations; for this reason, ?He, 
like “He, remains a liquid at pressures below ~ 30 bar, even at T — 0. ?He 
and *He are termed quantum liquids. At 107? K, a phase transition into the 
superfluid state takes place (1 = 1, s = 1) with formation of BCS pairs.'? 
In the superconductivity of metals, the Cooper pairs formed by the electrons 
have 1 =0 and s =0. The relatively complex phase diagram of ?He is shown 
in Fig. 4.10.4 


c) Nuclear Matter 


A further example of many-body systems containing fermions are the neu- 
trons and protons in the nucleus, which both have masses of about m = 
1.67 x 10~-*4g. The nuclear radius depends on the nucleon number A via 
R = 1.3 x 1078A'/3cm. The nuclear volume is V = R* = 44(1.3)% x 
10-3°Acm? = 9.2 x 107°9Acm?. A is the overall number of nucleons and 
Z the number of protons in the nucleus. Nuclear matter!? occurs not only 
within large atomic nuclei, but also in neutron stars, where however also the 
gravitational interactions must be taken into account. 


" D. Vollhardt and P. Wolfle, The Superfluid Phases of Helium 3, Taylor & Francis, 
London, 1990 

12 A.L. Fetter and J.D. Walecka, Quantum Theory of Many-Particle Systems, 
McGraw-Hill, New York 1971 
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d) White Dwarfs 


The properties of the (nearly) free electron gas are indeed of fundamental 
importance for the stability of the white dwarfs which can occur at the final 
stages of stellar evolution.'® The first such white dwarf to be identified, Sir- 
ius B, was predicted by Bessel as a companion of Sirius. 

Mass + Mo = 1.99 x 10*8¢ 

Radius 0.01R5, Ro = 7 x 10!°cm 

Density © 10’po = 10%g/em?, po = 1g/cm? 

PSirius B © 0.69 x 10°g/cm? 

Central temperature © 107K = Tp 

White dwarfs consist of ionized nuclei and free electrons. Helium can still be 
burned in white dwarfs. The Fermi temperature is Tr ~ 3-10°K, so that the 
electron gas is highly degenerate. The high zero-point pressure of the electron 
gas opposes the gravitational attraction of the nuclei which compresses the 
star. The electrons can in fact be regarded as free; their Coulomb repulsion 
is negligible at these high pressures. 


*e) The Landau Theory of Fermi Liquids 


The characteristic temperature dependences found for ideal Fermi gases at 
low temperatures remain in effect in the presence of interactions. This is the 
result of Landau’s Fermi liquid theory, which is based on physical arguments 
that can also be justified in terms of microscopic quantum-mechanical many- 
body theory. We give only a sketch of this theory, including its essential 
results, and refer the reader to more detailed literature!*. One first considers 


13 An often-used classification of the stars in astronomy is based on their positions in 
the Hertzsprung—Russell diagram, in which their magnitudes are plotted against 
their colors (equivalent to their surface temperatures). Most stars lie on the so 
called main sequence. These stars have masses ranging from about one tenth of 
the Sun’s mass up to a sixty-fold solar mass in the evolutionary stages in which 
hydrogen is converted to helium by nuclear fusion (‘burning’). During about 90% 
of their evolution, the stars stay on the main sequence — as long as nuclear fusion 
and gravitational attraction are in balance. When the fusion processes come to 
an end as their ‘fuel’ is exhausted, gravitational forces become predominant. In 
their further evolution, the stars become red giants and finally contract to one of 
the following end stages: in stars with less than 1.4 solar masses, the compression 
process is brought to a halt by the increase of the Fermi energy of the electrons, 
and a white dwarf is formed, consisting mainly of helium and electrons. Stars 
with two- or threefold solar masses end their contraction after passing through 
intermediate phases as neutron stars. Above three or four solar masses, the Fermi 
energy of the neutrons is no longer able to stop the compression process, and a 
black hole results. 

A detailed description of Landau’s Fermi liquid theory can be found in D. Pines 
and P. Noziéres, The Theory of Quantum Liquids, W. A. Benjamin, New York 
1966, as well as in J. Wilks, The Properties of Liquid and Solid Helium, Clarendon 
Press, Oxford, 1967. See also J. Wilks and D.S. Betts, An Introduction to Liquid 
Helium, Oxford University Press, 2" ed., Oxford, (1987). 
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the ground state of the ideal Fermi gas, and the ground state with an addi- 
tional particle (of momentum p); then the interaction is ‘switched on’. The 
ideal ground state becomes a modified ground state and the state with the 
additional particle becomes the modified ground state plus an excited quan- 
tum (a quasiparticle of momentum p). The energy of the quantum, ¢(p), is 
shifted relative to ¢9(p) = p”/2m. Since every non-interacting single-particle 
state is only singly occupied, there are also no multiply-occupied quasiparticle 
states; i.e. the quasiparticles also obey Fermi—Dirac statistics. 

When several quasiparticles are excited, their energy also depends upon 
the number dn(p) of the other excitations 


e(p) = co(p) + 90 F(p,p')én(p’) . (4.3.26) 


The average occupation number takes a similar form to that of ideal fermions, 
owing to the fermionic character of the quasiparticles: 


1 


~ GE)=H/kT 4” (4.3.27) 


Np 
where, according to (4.3.26), e(p) itself depends on the occupation number. 
This relation is usually derived in the present context by maximizing the 
entropy expression found in problem 4.2, which can be obtained from purely 
combinatorial considerations. At low temperatures, the quasiparticles are ex- 
cited only near the Fermi energy, and due to the occupied states and energy 
conservation, the phase space for scattering processes is severely limited. Al- 
though the interactions are by no means necessarily weak, the scattering rate 
vanishes with temperature as 4 ~ T?, ie. the quasiparticles are practically 
stable particles. 


The interaction between the quasiparticles can be written in the form 
F(p,o;p',o’) = f*(p,p’) +a - 0’ f"(p,p’) (4.3.28a) 


with the Pauli spin matrices o. Since only momenta in the neighborhood of 
the Fermi momentum contribute, we introduce 


f*(p,p’) = f(x) (4.3.28b) 


and 
EMO) Hert AO). = AUTRE f(y) (4.3.28) 


where y is the angle between p and p’ and v(epr) is the density of states. A 
series expansion in terms of Legendre polynomials leads to 


ao ek ys F/*P,(cosx) = 1+ FP“ cosy +... . (4.3.28d) 
i 


The F and fF" are the spin-symmetric and spin-antisymmetric Landau pa- 
rameters; the F;* result from the exchange interaction. 
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Due to the Fermi character of the quasiparticles, which at low temperatures 
can be excited only near the Fermi energy, it is clear from the qualitative 
estimate (4.3.16’) that the specific heat of the Fermi liquid will also have a 
linear temperature dependence. In detail, one obtains for the specific heat, 
the compressibility, and the magnetic susceptibility: 


1 
Cy= 3” Yer) al (4.3.29a) 
V v(er) 
ee 4.3.29b 
“TN? 1+ Fe’ ( ) 
2 U(ER)N 
= La ———— 4.3.2 
X=HBT Re? (4.3.29c) 
with the density of states v(er) = Vin pe and the effective mass ratio 
m* 1 
=1+=F, (4.3.29d) 
m 3 


The structure of the results is the same as for ideal fermions. 


4.4 The Bose—Einstein Condensation 


In this section, we investigate the low-temperature behavior of a nonrelativis- 
tic ideal Bose gas of spin s = 0, i.e. g = 1 and 
p 

a oe (4.4.1) 
In their ground state, non-interacting bosons all occupy the energetically 
lowest single-particle state; their low-temperature behavior is therefore quite 
different from that of fermions. Between the high-temperature phase, where 
the bosons are distributed over the whole spectrum of momentum values, 
corresponding to the Bose distribution function, and the phase in which the 
(p = 0) state is macroscopically occupied (at T = 0, all the particles are 
in this state), a phase transition takes place. This so called Bose-Einstein 
condensation of an ideal Bose gas was predicted by Einstein!® on the basis 
of the statistical considerations of Bose, nearly seventy years before it was 
observed experimentally. 

We first refer to the results of Sect 4.1, where we found for the particle 
density, i.e. for the reciprocal of the specific volume, in Eq. (4.1.17): 


Xe 


>a 93/2(2) (4.4.2a) 


' A. Einstein, Sitzber. Kgl. Preuss. Akad. Wiss. 1924, 261, (1924), ibid. 1925, 3 
(1925): S. Bose, Z. Phys. 26, 178 (1924) 
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with \ = fh,/27/mkT and, using (4.2.1), 


2 vs gle Soa gh 
93/2(2) = sz fe engl 1 = », [3/2 : (4.4.2b) 
0 = 


According to Remark (i) in Sect. 4.1, the fugacity of bosons z = e“/*™ is 
limited to z < 1. The maximum value of the function g3/2(z), which is shown 
in Fig. 4.11, is then given by g3/2(1) = ¢(3/2) = 2.612. 


6619 scomonates 


1 # 1/2.612 v/r3 


Fig. 4.11. The function g3/2(z). Fig. 4.12. The fugacity z as a func- 
tion of v/d® 


In the following, we take the particle number and the volume, and thus the 
specific volume v, to be fixed at given values. Then from Eq. (4.4.2a), we can 
calculate z as a function of T, or, more expediently, of v\~3. On lowering the 
temperature, 34 decreases and z therefore increases, until finally at $6 = 345 
it reaches its maximum value z = 1 (Fig. 4.12). This defines a characteristic 
temperature 

Qrh?/m 
(2.612 v)2/3 ° 
When z approaches 1, we must be more careful in taking the limit of So => 
J dp used in (4.1.14a) and (4.1.15). This is also indicated by the fact that 
(4.4.2a) would imply for z = 1 that at temperatures below T.(v), the density 
+ must decrease with decreasing temperature. From (4.4.2a), there would 
appear to no longer be enough space for all the particles. Clearly, we have 
to treat the (p = 0) term in the sum in (4.1.8), which diverges for z — 1, 
separately: 


kT.(v) = (4.4.3) 


1 1 V 
rie es ms BED) zi-1 | (27h) | @pn , 

p40 
The p = 0 state for fermions did not require any special treatment, since the 
average occupation numbers can have at most the value 1. Even for bosons, 
this modification is important only at T < T,(v) and leads at T = 0 to the 
complete occupation of the p = 0 state, in agreement with the ground state 
which we described above. 


N= 
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We thus obtain for bosons, instead of (4.4.2a): 
1 v 
Ni Sabai + N 5393/2(2) ; (4.4.4) 


or, using Eq. (4.4.3), 


ie a 93/2(2) : 
;) (4.4.4") 


1 
N= t+ N F 
ete) ee 93/2(1) 


The overall particle number N is thus the sum of the number of particles in 
the ground state 


No = (4.4.5a) 
and the numbers in the excited states 


2 pA 93/2(2) 
a -"(zo) nas eee) 


For T > T,(v), Eq. (4.4.4) yields a value for z of z < 1. The first term on 
the right-hand side of (4.4.4’) is therefore finite and can be neglected relative 
to N. Our initial considerations thus hold here; in particular, z follows from 


T.(v) 


3/2 
93/2(z) = 2.012( ) for T > T.(v) . (4.4.5c) 


For T < T.(v), from Eq. (4.4.4’), z = 1— O(1/N), so that all of the particles 
which are no longer in excited states can find sufficient ‘space’ to enter the 
ground state. When z is so close to 1, we can set z = 1 in the second term 
and obtain 


No=N ( = (am) | 


Defining the condensate fraction in the thermodynamic limit by 


Ni 
vo = lim WV? (4.4.6) 


we find in summary 


0 T > T.(v) 
Yo = 


ies Gy P27 (e): 


(4.4.7) 


This phenomenon is called the Bose-Einstein condensation. Below T,.(v), the 
ground state p = 0 is macroscopically occupied. The temperature depen- 
dence of vg and ,/v is shown in Fig. 4.13. The quantities vg and ,/v are 
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fi 
1 
VV 
Te(v) ~ v7 7/8 
Te T Uv 

Fig. 4.13. The relative number of par- Fig. 4.14. The transition tempera- 
ticles in the condensate and its square ture as a function of the specific vol- 
root as functions of the temperature ume 


characteristic of the condensation or the ordering of the system. For reasons 
which will become clear later, one refers to ,\/Yp as the order parameter. In 
the neighborhood of Ty, ,/v goes to zero as 


Vien Tea (4.4.7') 


In Fig. 4.14, we show the transition temperature as a function of the specific 
volume. The higher the density (i.e. the smaller the specific volume), the 
higher the transition temperature T.(v) at which the Bose-Einstein conden- 
sation takes place. 

Remark: One might ask whether the next higher terms in the sum >7,, n(ep) could 


not also be macroscopically occupied. The following estimate however shows that 
n(ép) < n(0) for p 4 0. Consider e.g. the momentum p = (27, 0,0), for which 


a 1 z 1 1 2 2m 
V ¢fpi/2mz-1_ 4 V efpi/2m _ 1 V Bpt 


~ Ov) 


holds, while $—+— ~ O(1) . 


zt 1 

There is no change in the grand potential compared to the integral represen- 
tation (4.1.19’), since for the term with p = 0 in the thermodynamic limit, 
it follows that 

1 — 
am 
Therefore, the pressure is given by (4.1.19’) as before, where z for T > T.(v) 
follows from (4.4.5c), and for T < T,(v) it is given by z = 1. Thus finally the 
pressure of the ideal Bose gas is 


1 1 
lim V log(1 — z(V)) = lim — log 0. 


V-0o V0 V 


kT 
cel 95/2(2) T>T, 

P= (4.4.8) 
ual 1.342 T<T 


a 
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93/2(2) 


Fig. 4.15. The functions g3/2(z) and 
95/2(z). In the limit z — 0, the functions 
become asymptotically identical, 93/2(z) © 
1 z 95/2 (z) RZ. 


isochores 


T< ~~ 


Pa KT1 342 
~T 52 


+— Classical behavior 
for large v and T 


Fig. 4.16. The equation of 
state of the ideal Bose gas. 
The isochores are shown for 


Uy > U2 decreasing values of v. For T < 
T.(v), the pressure is P = 
iy KP 1.342. 


with gs/2(1) = ¢ (3) = 1.342. If we insert z from (4.4.4) here, we obtain the 
equation of state. For T’ > T,, using (4.4.5c), we can write (4.4.8) in the form 
_ AT 95/2(2) 


P=— mo (4.4.9) 


The functions g5/2(z) and g3/2(z) are drawn in Fig. 4.15. The shape of the 
equation of state can be qualitatively seen from them. For small values of 
2, 95/2(2) © g3/2(z), so that for large v and high T, we obtain again from 
(4.4.9) the classical equation of state (see Fig. 4.16). On approaching T.(v), 
it becomes increasingly noticeable that g5/2(z) < 93/2(z). At T.(v), the iso- 
chores converge into the curve P = ee 1.342, which represents the pressure 
for T < T,(v). All together, this leads to the equation of state corresponding 
to the tsochores in Fig. 4.16. 

For the entropy, we find!® 


Nk Cee T>T 
OPV nae 
Vip 5 95/2(1) (zy 
k= = T<T, 
2 gsja(1) \ Te 


*® Note that 49.(z) = +gv-1(2). 
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Sans a a as ee ee 


Fig. 4.17. The heat capacity = Nx 
the specific heat of an ideal Bose gas 


T, T 


UT, (v) 
Oo 
v 


4 T/T, 


3 Fig. 4.18. The chemical potential of 
the ideal Bose gas at a fixed density 
as a function of the temperature 


and, after some calculation, we obtain for the heat capacity at constant vol- 
ume 


15 v 9 93/2(2) 
z T>T, 
as 4 3 95/24 ) 4 91 /2(2) 
Coat (=) = Nk . (4.4.11) 
ae 15 95/2(1) Ge T 
— — SAT. 
4 gsjo(1) \ Te 


The entropy and the specific heat vary as T?/? at low T. Only the excited 
states contribute to the entropy and the internal energy; the entropy of the 
condensate is zero. At T = T,, the specific heat of the ideal Bose gas has a 
cusp (Fig. 4.17). 

From Eq. (4.4.4) or from Fig. 4.12, one can obtain the chemical potential, 
shown in Fig. 4.18 as a function of the temperature. 

At T) = 2.18K, the so called lambda point, +He exhibits a phase transi- 
tion into the superfluid state (see Fig. 4.19). If we could neglect the interac- 
tions of the helium atoms, the temperature of a Bose-Einstein condensation 
would be T.(v) = 3.14K, using the specific volume of helium in (4.4.3). The 
interactions are however very important, and it would be incorrect to iden- 
tify the phase transition into the superfluid state with the Bose-Einstein 
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P 
solid 
25 
bar 
Hel 
ve | gas 
2.18K 5.19K T 

Fig. 4.19. The phase diagram of *He Fig. 4.20. The experimental specific 
(schematic). Below 2.18 K, a phase heat of “He, showing the characteristic 
transition from the normal liquid He I lambda anomaly 


phase into the superfluid He II phase 
takes place 


condensation treated above. The superfluid state in three-dimensional he- 
lium is indeed also created by a condensation (macroscopic occupation) of 
the p = 0 state, but at T = 0, the fraction of condensate is only 8%. The 
specific heat (Fig. 4.20) exhibits a A anomaly (which gives the transition its 
name), i.e. an approximately logarithmic singularity. The typical excitation 
spectrum and the hydrodynamic behavior as described by the two-fluid model 
are compatible only with an interacting Bose system (Sect. 4.7.1). 

Another Bose gas, which is more ideal than helium and in which one can 
likewise expect a Bose-Einstein condensation — which has been extensively 
searched for experimentally — is atomic hydrogen in a strong magnetic field 
(the spin polarization of the hydrogen electrons prevents recombination to 
molecular Hz). Because of the difficulty of suppressing recombination of H 
to Hz, over a period of many years it however proved impossible to prepare 
atomic hydrogen at a sufficient density. The development of atom traps has 
recently permitted remarkable progress in this area. 

The Bose-Einstein condensation was first observed, 70 years after its orig- 
inal prediction, in a gas consisting of around 2000 spin-polarized ®’Rb atoms, 
which were enclosed in a quadrupole trap.'”’!® The transition temperature is 
at 170 x 10-°K. One might at first raise the objection that at low tempera- 
tures the alkali atoms should form a solid; however, a metastable gaseous state 
can be maintained within the trap even at temperatures in the nanokelvin 
range. In the initial experiments, the condensed state could be kept for about 
ten seconds. Similar results were obtained with a gas consisting of 2 x 10° 


17 M.H. Anderson, J. R. Ensher, M. R. Matthews, C. E. Wieman, and E. A. Cornell, 
Science 269, 198 (1995) 
'8 See also G. P. Collins, Physics Today, August 1995, 17. 
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spin-polarized “Li atoms.'® In this case, the condensation temperature is 
T. ~ 400 x 10-°K. In ®’Rb, the s-wave scattering length is positive, while in 
“Li, it is negative. However, even in “Li, the gas phase does not collapse into 
a condensed phase, in any case not within the spatially inhomogeneous atom 
trap.!°Finally, it also proved possible to produce and maintain a conden- 
sate containing more than 10° atoms of atomic hydrogen, with a transition 
temperature of about 50 uK, for up to 5 seconds.?° 


4.5 The Photon Gas 


4.5.1 Properties of Photons 


We next want to determine the thermal properties of the radiation field. To 
start with, we list some of the characteristic properties of photons. 


(i) Photons obey the dispersion relation ¢, = c|p| = fick and are bosons 
with a spin s = 1. Since they are completely relativistic particles (m = 0, v = 
c), their spins have only two possible orientations, i.e. parallel or antiparallel 
to p, corresponding to right-hand or left-hand circularly polarized light (0 and 
m are the only angles which are Lorentz invariant). The degeneracy factor for 
photons is therefore g = 2. 

(ii) The mutual interactions of photons are practically zero, as one can 
see from the following argument: to lowest order, the interaction consists 
of the scattering of two photons 7, and yz into the final states y3 and 
ya; see Fig. 4.21a. In this process, for example photon y, decays into a 
virtual electron-positron pair, photon 72 is absorbed by the positron, the 
electron emits photon y3 and recombines with the positron to give photon 
ya. The scattering cross-section for this process is extremely small, of order 
ao = 10~-°° cm?. The mean collision time can be calculated from the scattering 
cross-section as follows: in the time At, a photon traverses the distance cAt. 
We thus consider the cylinder shown in Fig. 4.21b, whose basal area is equal 
to the scattering cross-section and whose length is the velocity of light x At. 
A photon interacts within the time At with all other photons which are in 
the volume co At, roughly speaking. Let N be the total number of photons 
within the volume V (which depends on the temperature and which we still 
have to determine; see the end of Sect. 4.5.4). Then a photon interacts with 
co N/V particles per unit time. Thus the mean collision time (time between 
two collisions on average) 7 is determined by 


ce (V/N) ay 1910 ee V 


co cm’ N 


19 C.C. Bradley, C. A. Sackett, J. J. Tollett, and R.G. Hulet, Phys. Rev. Lett. 75, 
1687 (1995) 
20 1. Kleppner, Th. Greytak et al., Phys. Rev. Lett. 81, 3811 (1998) 
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cAt 


Fig. 4.21. (a) Photon-photon scattering (dashed lines: photons; solid lines: electron 
and positron). (b) Scattering cross-section and mean collision time 


The value of the mean collision time is approximately tr ~ 10%! sec at room 
temperature and t © 101% sec at the temperature of the Sun’s interior (10’K). 
Even at the temperature in the center of the Sun, the interaction of the pho- 
tons it negligible. In comparison, the age of the Universe is ~ 10!’sec. Photons 
do indeed constitute an ideal quantum gas. 

The interaction with the surrounding matter is crucial in order to establish 
equilibrium within the radiation field. The establishment of equilibrium in 
the photon gas is brought about by absorption and emission of photons by 
matter. In the following, we will investigate the radiation field within a cavity 
of volume V and temperature 7’, and without loss of generality of our consid- 
erations, we take the quantization volume to be cubical in shape (the shape 
is irrelevant for short wavelengths, and the long waves have a low statistical 
weight). 

(iii) The number of photons is not conserved. Photons are emitted and ab- 
sorbed by the material of the cavity walls. From the quantum-field descrip- 
tion of photons it follows that each wavenumber and polarization direction 
corresponds to a harmonic oscillator. The Hamiltonian thus has the form 


i= do epp, = So epal, yap,a ; p#0, (4.5.1) 
p,A pr 


fee ne i , 
where Np,, = 4, \4, , is the occupation number operator for the momen- 


tum p and the direction of polarization .; also, a, y? Gp, are the creation 
and annihilation operators for a photon in the state p, A. We note that in 
the Hamiltonian of the radiation field, there is no zero-point energy, which is 
automatically accomplished in quantum field theory by defining the Hamil- 


tonian in terms of normal-ordered products.?! 


2! CG. Itzykson, J.-B. Zuber, Quantum Field Theory, McGraw-Hill; see also QM II. 
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4.5.2 The Canonical Partition Function 


The canonical partition function is given by (np,, = 0,1,2,...): 
2 


1 
= ,—-BH _ r>—B >, Epp, r — 
Za tie FS ye P eyertaS ees ; (4.5.2) 
{np,v} p40 
Here, there is no condition on the number of photons, since it is not fixed. 
In (4.5.2), the power 2 enters due to the two possible polarizations A. With 
this expression, we find for the free energy 


F(T,V) = —kT log Z = 2kT \~ log (1 2, ad 
p40 


The sum has been converted to an integral according to (4.1.14a). For the 
integral in (4.5.3), we find after integration by parts 


Co 


1 dx x? 1 T 
2 —2“ — pany 
[oe log(l —e7*) = |< a 2 a 2¢(4) 5 
0 0 am 


where ¢(n) is Riemann’s ¢-function (Eqs. (D.2) and (D.3)), so that for F’, we 
have finally 


V(kT)* rr? 40. 4 
F(T = = T 4.5.4 
oe. (fic)? 45 3c he ae 


with the Stefan—Boltzmann constant 


an k4 
° = 60h 


= 5.67 X10? Jsec-*m-* K™., (4.5.5) 


From (4.5.4), we obtain the entropy: 


OF 16a 
pes eee a eps 4.5. 
S (=) ‘ ae VT”, (4.5.6a) 


the internal energy (caloric equation of state) 
E=F+TS =—VvrT", (4.5.6b) 
c 


and the pressure (thermal equation of state) 
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OF 4o 
Pre |e) Sg 5. 
ean 3o1 (4.5.6c) 


and finally the heat capacity 
= Vr (4.5.7) 


Because of the relativistic dispersion, for photons 
E=3PV 


holds instead of PV. Eq. (4.5.6b) is called the Stefan—Boltzmann law: the 
internal energy of the radiation field increases as the fourth power of the 
temperature. The radiation pressure (4.5.6c) is very low, except at extremely 
high temperatures. At 10°K, the temperature produced by the a nuclear 
explosion, it is P = 0.25 bar, and at 10’ K, the Sun’s central temperature, it 
is P= 25X10" bar: 


4.5.3 Planck’s Radiation Law 


We now wish to discuss some of the characteristics of the radiation field. The 
average occupation number of the state (p, A) is given by 


1 
(p,r) = op/kT 4] (4.5.8a) 
with ¢p = hwp = cp, since 


oo 
> n yen hp.sep/kT 
—BHy Pp, 
( ) Tre Tp,A Np, r=0 
Nprv) = = 
= Tre 8H 


oo 
> eT p, Ep /kT. 


Np,»=0 


can be evaluated analogously to Eq. (4.1.9). The average occupation number 
(4.5.8a) corresponds to that of atomic or molecular free bosons, Eq. (4.1.9), 
with w= 0. 

The number of occupied states in a differential element d°p within a fixed 
volume is therefore (see (4.1.14a)): 


2V 


(np,n) Gane dp, (4.5.8b) 


and in the interval [p, p + dp], it is 


7 
(Np,r) 273 p dp. (4.5.8c) 
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It follows from this that the number of occupied states in the interval [w, w+ 
dw] is equal to 


V wdw 
723 eliw/kT _ 7 . (4.5.8d) 
The spectral energy density u(w) is defined as the energy per unit volume 
and frequency, i.e. as the product of (4.5.8d) with iw/V: 


h w 
u(w) = a yer (4.5.9) 
This is the famous Planck radiation law (1900), which initiated the develop- 
ment of quantum mechanics. 

We now want to discuss these results in detail. The occupation number 
(4.5.8a) for photons diverges for p — 0 as 1/p (see Fig. 4.22), since the 
energy of the photons goes to zero when p — 0. Because the density of states 
in three dimensions is proportional to w?, this divergence is irrelevant to the 
energy content of the radiation field. The spectral energy density is shown in 
Fig 4.22. 


Fig. 4.22. The photon number as 
a function of hw/kT (dot-dashed 
curve). The spectral energy den- 
sity as a function of hw/kT (solid 
curve). 


As a function of hw, it shows a maximum at 
Rwmax = 2.82 kT , (4.5.10) 


i.e. around three times the thermal energy. The maximum shifts proportion- 
ally to the temperature. Equation (4.5.10), Wien’s displacement law (1893), 
played an important role in the historical development of the theory of the 
radiation field, leading to the discovery of Planck’s quantum of action. In 
Fig. 4.23, we show u(w, 7) for different temperatures T. 
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u(wW) 


Fig. 4.23. Planck’s law for three tem- 
peratures, T) < To < T3 


We now consider the limiting cases of Planck’s radiation law: 


(i) hw < kT: for low frequencies, we find using (4.5.9) that 


kTw? 


ulw) = a 


(4.5.11) 
the Rayleigh—Jeans radiation law. This is the classical low-energy limit. This 
result of classical physics represented one of the principal problems in the 
theory of the radiation field. Aside from the fact that it agreed with experi- 
ment only for very low frequencies, it was also fundamentally unacceptable: 
for according to (4.5.11), in the high-frequency limit w — oo, it leads to a 
divergence in u(w), the so called ultraviolet catastrophe. This would in turn 
imply an infinite energy content of the cavity radiation, i See dw u(w) = oo. 
(ii) fiw >> kT: In the high-frequency limit, we find from (4.5.9) that 


fig? <a 
uw) = Gao (4.5.12) 
The energy density decreases exponentially with increasing frequency. This 
empirically derived relation is known as Wien’s law. In his first derivation, 
Planck farsightedly obtained (4.5.9) by interpolating the corresponding en- 
tropies between equations (4.5.11) and (4.5.12). 


Often, the energy density is expressed in terms of the wavelength X: starting 


from w = ck = 2c we obtain dw = — aee dX. Therefore, the energy per unit volume 


in the interval [A, A + dA] is given by 


2 
dE, = (« _ =) Fa a a aor: Be dX (4.5.13) 
d dX 5 (c amine = 1) 


where we have inserted (4.5.9). The energy density as a function of the wavelength 
ue has its maximum at the value Amax, determined by 


2rhe 
kT X\max 


= 4.965 . (4.5.14) 
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We will now calculate the radiation which emerges from an opening in 
the cavity at the temperature T. To do this, we first note that the radiation 
within the cavity is completely isotropic. The emitted thermal radiation at a 
frequency w into a solid angle d? is therefore u(w)42. The radiation energy 
which emerges per unit time onto a unit surface is 

20 

1 


1 
Mw.T) =z | d2eu(w) cos = ie 
0 


ag [ dnneule) - 7 u(w) . (4.5.15) 
0 


The integration over the solid angle d( extends over only one hemisphere 
(see Fig. 4.24). The total radiated power per unit surface (the energy flux) is 
then 


Ip(T) = jf tet(w.7) mi hie (4.5.16) 


where again the Stefan—Boltzmann constant o from Eq. (4.5.5) enters the 
expression. 


Fig. 4.24. The radiation emis- 
sion per unit surface area from 
a cavity radiator (black body) 


A body which completely absorbs all the radiation falling upon it is called 
a black body. A small opening in the wall of a cavity whose walls are good 
absorbers is the ideal realization of a black body. The emission from such an 
opening calculated above is thus the radiation emitted by a black body. As 
an approximation, Eqns. (4.5.15,16) are also used to describe the radiation 
from celestial bodies. 


Remark: The Universe is pervaded by the so called cosmic background radiation 
discovered by Penzias and Wilson, which corresponds according to Planck’s law to 
a temperature of 2.73 K. It is a remainder from the earliest times of the Universe, 
around 300,000 years after the Big Bang, when the temperature of the cosmos had 
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already cooled to about 3000 K. Previous to this time, the radiation was in thermal 
equilibrium with the matter. At temperatures of 3000 K and below, the electrons 
bond to atomic nuclei to form atoms, so that the cosmos became transparent to 
this radiation and it was practically decoupled from the matter in the Universe. 
The expansion of the Universe by a factor of about one thousand then led to a 
corresponding increase of all wavelengths due to the red shift, and thus to a Planck 
distribution at an effective temperature of 2.73 K. 


*4.5.4 Supplemental Remarks 


Let us now interpret the properties of the photon gas in a physical sense and 
compare it with other gases. 
The mean photon number is given by 


[oe) 


! 1 4 dw w? 
se 2) yo / eliw/kT _ | 
P 0 


VV (RES f daw? — 2¢(3),, (T° 
 72e3H3 e—1 7 he ' 
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where the value p = 0 is excluded in ae Inserting ¢(3), we obtain 


3 
N =0.244V (=) (4.5.17) 
C 


Combining this with (4.5.6c) and (4.5.6a) and inserting approximate numer- 
ical values shows a formal similarity to the classical ideal gas: 


PV =0.9NkT (4.5.18) 
S=3.6Nk, (4.5.19) 
where N is however always given by (4.5.17) and does not have a fixed value. 
The pressure per particle is of about the same order of magnitude as in the 


classical ideal gas. 
The thermal wavelength of the photon gas is found to be 


Qn 2rhe _ se 
kimax 2.82kT T|K] 


Ar = cm] . (4.5.20) 
With the numerical factor 0.510, Ar is obtained in units of cm. Inserting into 
(4.5.17), we find 


In \? V V 
N = 0.244 | —— —= =2.70—~. 4.5.21 
3) wu % Cen 


Vv 
> NAS 
photons (V/N)!/3 is, from (4.5.21), of the order of magnitude of \r, and 
therefore, they must be treated quantum mechanically. 


For the classical ideal gas > 1; in contrast, the average spacing of the 
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At room temperature, i.e. T = 300K, Av = 1.7 x 107-3 cm and the density 
is X =5.5x 10% cm~*. At the temperature of the interior of the Sun, i.e. T ~ 
10°K, Ay = 5.1 x 107-8 cm and the density is x = 2.0 x 10?2cm7?. In 
comparison, the wavelength of visible light is in the range \ = 10~+cm. 
Note: If the photon had a finite rest mass m, then we would have g = 3. In 
that case, a factor of 3 would enter the Stefan—Boltzmann law. The exper- 
imentally demonstrated validity of the Stefan—Boltzmann law implies that 


either m = 0, or that the longitudinal photons do not couple to matter. 


The chemical potential: The chemical potential of the photon gas can be com- 
puted from the Gibbs—Duhem relation E = TS — PV + wN, since we are dealing 
with a homogeneous system: 


1 1 16 4\ oVT® 
= —(E-TS+PV)= 4 =0. 4.5.22 
Bay Say x( +5) Be (ee?) 


The chemical potential of the photon gas is identical to 0 for all temperatures, 
because the number of photons is not fixed, but rather adjusts itself to the tem- 
perature and the volume. Photons are absorbed and emitted by the surrounding 
matter, the walls of the cavity. In general, the chemical potential of particles and 
quasiparticles such as phonons, whose particle numbers are not subject to a con- 
servation law, is zero. For example we consider the free energy of a fictitious con- 
stant number of photons (phonons etc.), F(T, V, Npn). since the number of photons 
(phonons) is not fixed, it will adjust itself in such a way that the free energy is 
potential, which therefore vanishes: = 0. We could have just as well started from 


#= 0. 


minimized, ) = 0. This is however just the expression for the chemical 
T,V 


the maximization of the entropy, ( ae -) = 
E,V 


*4.5.5 Fluctuations in the Particle Number of Fermions and Bosons 


Now that we have become acquainted with the statistical properties of various 
quantum gases, that is of fermions and bosons (including photons, whose 
particle-number distribution is characterized by » = 0), we now want to 
investigate the fluctuations of their particle numbers. For this purpose, we 
begin with the grand potential 


& = —B" log S~ e FU preer—H) | (4.5.23) 
{np} 


Taking the derivative of ® with respect to €, yields the mean value of nq: 


> n eB Up Mr (Er—H) 
OP {np} : 
0€q an be e 8 Up Np(Ep—L) 
{np} 


= (nq) - (4.5.24) 
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The second derivative of & yields the mean square deviation 


Oh 


sor = —B{(n2) — (rq)? } = —B( Ang)? (4.5.25) 
qd 
Thus, using — = Te on we obtain 
(€q-H) 
(Ang) = go 2kna) da EE he (mqg)(1+(mq)). (4.5.26) 


0€q (ef (ea—H) ae, 1)? 


For fermions, the mean square deviation is always small. In the range of 
occupied states, where (nz) = 1, An, is zero; and in the region of small (nq), 


Ang © (1q)!?. 


Remark: For bosons, the fluctuations can become very large. In the case of large 
occupation numbers, we have Ang ~ (n(q)) and the relative deviation approaches 
one. This is a consequence of the tendency of bosons to cluster in the same state. 
These strong fluctuations are also found in a spatial sense. If N bosons are enclosed 
in a volume of L?, then the mean number of bosons in a subvolume a® is given by 
fi = Na®/L*. In the case that a < 4, where ) is the extent of the wavefunctions 
of the bosons, one finds the mean square deviation of the particle number (AN,3)* 
within the subvolume to be”? 


(AN,3)? =n(n+1). 


For comparison, we recall the quite different behavior of classical particles, which 
obey a Poisson distribution (see Sect. 1.5.1). The probability of finding n particles 
in the subvolume a? for a/L < 1 and N > oo is then 


_~”zAnN 
P, =e" — 
n! 


with 7 = Na®/L*, from which it follows that 


The deviations of the counting rates of bosons from the Poisson law have been 
experimentally verified using intense photon beams.?? 


4.6 Phonons in Solids 


4.6.1 The Harmonic Hamiltonian 


We recall the mechanics of a linear chain consisting of N particles of mass m 
which are coupled to their nearest neighbors by springs of force constant f. 
In the harmonic approximation, its Hamilton function takes on the form 


22 A detailed discussion of the tendency of bosons to cluster in regions where their 
wavefunctions overlap may be found in E. M. Henley and W. Thirring, Elemen- 
tary Quantum Field Theory, McGraw Hill, New York 1962, p. 52ff. 

23 -R. Hanbury Brown and R.Q. Twiss, Nature 177, 27 (1956). 
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H=Wo4 De E ue + ! (Un —Un—1)"| . (4.6.1) 
One obtains expression (4.6.1) by starting from the Hamilton function of 
N particles whose positions are denoted by z,. Their equilibrium positions 
are x°, where for an infinite chain or a finite chain with periodic boundary 
conditions, the equilibrium positions have exact translational invariance and 
the distance between neighboring equilibrium positions is given by the lattice 
constant a = x?) ,, — x). One then introduces the displacements from the 
equilibrium positions, wu, = %, — 2°, and expands in terms of the un. The 
quantity Wo is given by the value of the overall potential energy W({z,}) of 
the chain in the equilibrium positions. Applying the canonical transformation 


i ik . m —ik 
i, = = ) "On, Mtn = 4/— ) ere? Pe, 4.6.2 
Vn ~ Qk N - k ( ) 


we can transform H into a sum of uncoupled harmonic oscillators 
1 
H=W+)- 5 (PaP-1 + wRQKQ—E) (4.6.1’) 
k 


where the frequencies are related to the wavenumber via 


k 
we = 24] f sin s (4.6.3) 


The Q, are called normal coordinates and the Py normal momenta. The Q; 
and Py are conjugate variables, which we will take to be quantum-mechanical 
operators in what follows. In the quantum representation, commutation rules 
hold: 


[ete Mtn’ =ihdnn’ , [Uns Un’) = [Mtin, Mtn’ =0 
which in turn imply that 
Qk, Pe] =ihdew , [Qe, Qe] = [Pr, Po] = 0; 


furthermore, we have Qi =Q_, and Pi = P_, . Finally, by introducing 
the creation and annihilation operators 


Qr= Baog (ok oF a‘ ,) » =i = (04 = a.) ) (4.6.4) 
we obtain 
7 1 
H=Wo+ > hwr (1 + 5) (4.6.1”) 


k 
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with the occupation (number) operator 

Ap =ala, (4.6.5) 
and [ay ay] = Okk’ 5 [ak, ax’ = [al al] =0. 
In this form, we can readily generalize the Hamiltonian to three dimensions. 
In a three-dimensional crystal with one atom per unit cell, there are three 
lattice vibrations for each wavenumber, one longitudinal (/) and two trans- 
verse (t1,t2) (see Fig. 4.25). If the unit cell contains s atoms, there are 3s 
lattice vibrational modes. These are composed of the three acoustic modes, 


whose frequencies vanish at k = 0, and the 3(s — 1) optical phonon modes, 
whose frequencies are finite at k = 0.74 


Fig. 4.25. The phonon 
frequencies in a crystal 


. with one atom per unit 
-1/a 0 n/a ki=k cell 


We shall limit ourselves to the simple case of a single atom per unit cell, 
i.e. to Bravais-lattice crystals. Then, according to our above considerations, 
the Hamiltonian is given by: 


z 1 
H=Wo(V) + S- hw, (7%. + 5) (4.6.6) 
kd 


Here, we have characterized the lattice vibrations in terms of their wavevector 
k and their polarization ». The associated frequency is w,,, and the oper- 
ator for the occupation number is fix,,. The potential energy Wo(V) in the 
equilibrium lattice locations of the crystal depends on its lattice constant, or, 
equivalently when the number of particles is fixed, on the volume. For brevity, 
we combine the wavevector and the polarization into the form k = (k, A). Ina 
lattice with a total of N atoms, there are 3N vibrational degrees of freedom. 


4 See e.g. J.M. Ziman, Principles of the Theory of Solids, 2"¢ edition, Cambridge 
University Press, 1972. 
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4.6.2 Thermodynamic Properties 


In analogy to the calculation for photons, we find for the free energy 
F =—kTlogZ =W(V pe EE + kT log (1- eine )| . (4.6.7) 


The internal energy is found from 


B=-? 8 ; (4.6.8) 
OT Ts 
thus 
neh 1 4 ! 
k 


It is again expedient for the case of phonons to introduce the normalized 
density of states 


1 
gw) = 3N » d(w — wr) , (4.6.9) 
where the prefactor has been chosen so that 
fw gw)=1. (4.6.10) 


0 


Using the density of states, the internal energy can be written in the form: 


hw 


where we have used Ep = °, hw, /2 to denote the zero-point energy of the 
phonons. For the thermodynamic quantities, the precise dependence of the 
phonon frequencies on wavenumber is not important, but instead only their 
distribution, i.e. the density of states. 

Now, in order to determine the thermodynamic quantities such as the 
internal energy, we first have to calculate the density of states, g(w). For 
small k, the frequency of the longitudinal phonons is wy, = qk, and that of 
the transverse phonons is wx, = ck, the latter doubly degenerate; here, c; 
and c; are the longitudinal and transverse velocities of sound. Inserting these 
expressions into (4.6.9), we find 


Vw (1 2 
Dis fyi ban ee af ia he ee 
g(w) = ~ dq2 wz fake 325 (w — cyk) + 26(w — ek)| NG (= + =) i 


(4.6.12) 
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Equation (4.6.12) applies only to low frequencies, i.e. in the range where 
the phonon dispersion relation is in fact linear. In this frequency range, the 
density of states is proportional to w?, as was also the case for photons. 
Using (4.6.12), we can now compute the thermodynamic quantities for low 
temperatures, since in this temperature range. only low-frequency phonons 
are thermally excited. In the high-temperature limit, as we shall see, the 
detailed shape of the phonon spectrum is unimportant; instead, only the 
total number of vibrational modes is relevant. We can therefore treat this 
case immediately, also (Eq. 4.6.14). At low temperatures only low frequencies 
contribute, since frequencies w >> kT'/h are suppressed by the exponential 
function in the integral (4.6.11). Thus the low-frequency result (4.6.12) for 
g(w) can be used. Corresponding to the calculation for photons, we find 


Vre-kt (1 2 
E=W(V)+ £4 T?. 4.6.13 
o(V) O38 0h8 (= =) ( ) 


At high temperatures, i.e. temperatures which are much higher than hwmax/k, 


where Wax is the maximum frequency of the phonons, we find for all frequen- 


cies at which g(w) is nonvanishing that (e"/*? — in my AT 


it follows from (4.6.11) and (4.6.10) that 


and therefore, 


E=W(V)+ Eo +3NkT. (4.6.14) 


Taking the derivative with respect to temperature, we obtain from (4.6.13) 
and (4.6.14) in the low-temperature limit 


Cy ~T?; (4.6.15) 
this is Debye’s law. In the high-temperature limit, we have 
Cy & 3Nk, (4.6.16) 


the law of Dulong—Petit. At low temperatures, the specific heat is proportional 
to T°, while at high temperatures, it is equal to the number of degrees of 
freedom times the Boltzmann constant. 

In order to determine the specific heat over the whole range of tempera- 
tures, we require the normalized density of states g(w) for the whole frequency 
range. The typical shape of g(w) for a Bravais crystal?4 is shown in Fig. 4.26. 
At small values of w, the w? behavior is clearly visible. Above the maximum 
frequency, g(w) becomes zero. In intermediate regions, the density of states 
exhibits characteristic structures, so called van Hove singularities?4 which 
result from the maxima, minima, and saddle points of the phonon dispersion 
relation; their typical form is shown in Fig. 4.27. 

An interpolation formula which is adequate for many purposes can be 
obtained by approximating the density of states using the Debye approxima- 
tion: 
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Wp W 
Fig. 4.26. The phonon density of Fig. 4.27. A phonon dispersion rela- 
states g(w). Solid curve: a realistic tion with maxima, minima, and sad- 
density of states; dashed curve: the dle points, which express themselves 
Debye approximation in the density of states as van Hove 
singularities 
3w? 
gp(w) = —z O(wp — w), (4.6.17a) 
“Dp 
with 
Aan eo cae a (4.6.17b) 
oe. That ee” 32 


With the aid of (4.6.17a), the low-frequency expression (4.6.12) is extended 
to cover the whole range of frequencies and is cut off at the so called Debye 
frequency wp, which is chosen in such a way that (4.6.10) is obeyed. The 
Debye approximation is also shown in Fig. 4.26. 

Inserting (4.6.17a) into (4.6.11), we obtain 


E=W)(V)+E)+3NkTD (=) (4.6.18) 
with 
pia = & f dy y" (4.6.19) 
cv) = 53 ev—] 5 .O. 


0 


Taking the temperature derivative of (4.6.18), we obtain an expression for 
the specific heat, which interpolates between the two limiting cases of the 
Debye and the Dulong-Petit values (see Fig. 4.28). 


*4.6.3 Anharmonic Effects 
and the Mie—Griineisen Equation of State 


So far, we have treated only the harmonic approximation. In fact, the Hamil- 
tonian for phonons in a crystal also contains anharmonic terms, e.g. 
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Fig. 4.28. The heat capacity of a 
monatomic insulator. At low temper- 
atures, Cy ~ T°; at high tempera- 
tures, it is constant 


Aint = S- (ki, k2) Qk: Qko Q—ki ko 


ki,ke 


with coefficients c(k1,k2). Terms of this type and higher powers arise from 
the expansion of the interaction potential in terms of the displacements of the 
lattice components. These nonlinear terms are responsible for (i) the ther- 
mal expansion of crystals, (ii) the occurrence of a linear term in the specific 
heat at high T, (iii) phonon damping, and (iv) a finite thermal conductivity. 
These terms are also decisive for structural phase transitions. A systematic 
treatment of these phenomena requires perturbation-theory methods. The an- 
harmonic terms have the effect that the frequencies w; depend on the lattice 
constants, i.e. on the volume V of the crystal. This effect of the anharmonicity 
can be taken into account approximately by introducing a minor extension 
to the harmonic theory of the preceding subsection for the derivation of the 
equation of state. 

We take the volume derivative of the free energy F’. In addition to the 
potential energy Wo of the equilibrium configuration, also wz, owing to the 
anharmonicities, depends on the volume V; therefore, we find for the pressure 


OF OW 1 1 Olog wp 
P= = hw . (4.6.2 
ar). aV » ‘(G+aar) oye) 


For simplicity, we assume that the logarithmic derivative of wz, with respect 
to the volume is the same for all wavenumbers (the Griineisen assumption): 


O log wr 1 Olog wz 1 
= = : 4.6.21 
OV. VdlogpV 'V ( ) 
The material constant y which occurs here is called the Griineisen constant. 
The negative sign indicates that the frequencies become smaller on expansion 
of the lattice. We now insert (4.6.21) into (4.6.20) and compare with (4.6.8’), 
thus obtaining, with Ep, = E — Wo, the Mie—Grtineisen equation of state: 


P=—-— 244-2. (4.6.22) 


This formula applies to insulating crystals in which there are no electronic 
excitations and the thermal behavior is determined solely by the phonons. 
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From the Mie—Grtineisen equation of state, the various thermodynamic 
derivatives can be obtained, such as the thermal pressure coefficient (3.2.5) 


OP 
=|— = Cv (T)/V 4.6.23 
8 (sz), 4Cv(T)/ (4.6.23) 
and the linear expansion coefficient (cf. Appendix I., Table I.3) 
1 (OV 
= >> (=> 4.6.24 
bi wv (SE). (4.6.24) 
dP) __ (av av) — (ar), ee 
which, owing to (Sry a (Sh) p/ ($5) p = ~Z7> can also be given in 
the form 
1 
Ql = ghhr . (4.6.25) 


In this last relation, at low temperatures the compressibility can be replaced 


by 
1 (AV aW\* 
kr(0) =z ee = (v ATE ) (4.6.26) 


At low temperatures, from Eqns. (4.6.23) and (4.6.25), the coefficient of ther- 
mal expansion and the thermal pressure coefficient of an insulator, as well as 
the specific heat, are proportional to the third power of the temperature: 


axB«T?. 


As aresult of the thermodynamic relationship of the specific heats (3.2.24), we 
find Cp—Cy « T”. Therefore, at temperatures below the Debye temperature, 
the isobaric and the isochoral specific heats are practically equal. 

In analogy to the phonons, one can determine the thermodynamic proper- 
ties of other quasiparticles. Magnons in antiferromagnetic materials likewise 
have a linear dispersion relation at small values of & and therefore, their 
contribution to the specific heat is also proportional to T?. Magnons in ferro- 
ee have a quadratic dispersion relation ~ k?, leading to a specific heat 
~ T3 2. 


4.7 Phonons und Rotons in He II 


4.7.1 The Excitations (Quasiparticles) of He II 


At the conclusion of our treatment of the Bose-Einstein condensation in 4.4, 
we discussed the phase diagram of “He. In the He II phase below T, = 2.18 K, 
4He undergoes a condensation. States with the wavenumber 0 are occupied 


214 4. Ideal Quantum Gases 


macroscopically. In the language of second quantization, this means that the 
expectation value of the field operator y(x) is finite. The order parameter 
here is (w(x)).2° The excitation spectrum is then quite different from that 
of a system of free bosons. We shall not enter into the quantum-mechanical 
theory here, but instead use the experimental results as starting point. At 
low temperatures, only the lowest excitations are relevant. In Fig. 4.29, we 
show the excitations as determined by neutron scattering. 


energy EpK 


Fig. 4.29. The quasiparticle excita- 
tions in superfluid *He: phonons and 


0 1 2 3 4 
wavenumber (p/h) A" rotons after Henshaw and Woods.”° 


The excitation spectrum exhibits the following characteristics: for small val- 
ues of p, the excitation energy depends linearly on the momentum 


Ep =cp. (4.7.1a) 


In this region, the excitations are called phonons, whose velocity of sound 
is c = 238m/sec. A second characteristic of the excitation spectrum is a 
minimum at po = 1.91 A~'hA. In this range, the excitations are called rotons, 
and they can be represented by 


_ 2 
_ a, (pl=2) 


ep = Tate (4.7.1b) 


25 We have 
ao |go(N)) = VN |do(N — 1)) © VN |do(N)) 
ai, |o(N)) = VN +1|¢o(N +1)) ¥ VN |do(N)) , 


since due to the macroscopic occupation of the ground state, N > 1. See for 
example QM II, Sect. 3.2.2. 
26 1D. G. Henshaw and A.D. Woods, Phys. Rev. 121, 1266 (1961) 
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with an effective mass = 0.16 mye and an energy gap A/k = 8.6K. These 
properties of the dispersion relations will make themselves apparent in the 
thermodynamic properties. 


4.7.2 Thermal Properties 


At low temperatures, the number of excitations is small, and their inter- 
actions can be neglected. Since the “He atoms are bosons, the quasiparti- 
cles in this system are also bosons.?” We emphasize that the quasiparticles 
in Eqns. (4.7.1a) and (4.7.1b) are collective density excitations, which have 
nothing to do with the motions of individual helium atoms. 

As a result of the Bose character and due to the fact that the number of 
quasiparticles is not conserved, i.e. the chemical potential is zero, we find for 
the mean occupation number 


n(ep) = (8? —1) (4.7.2) 
From this, the free energy follows: 


kT 
F(T,V) = son | be log (1—e7~?) , (4.7.3a) 


and for the average number of quasiparticles 


V 
Nap(L,V) = Gane | enter) (4.7.3b) 
and the internal energy 
i 3 
E(T,V) = (ahs d°’pépn(Ep) - (4.7.3c) 


At low temperatures, only the phonons and rotons contribute in (4.7.3a) 
through (4.7.3c), since only they are thermally excited. The contribution of 
the phonons in this limit is given by 


nV (kT)4 mV (kT)4 
Ey = —-——— Ey, = = 4.7.4a,b 
Pe 90(ficys PA 30 (Ficy8 a) 
From this, we find for the heat capacity at constant volume: 
Qn?2VkAT3 
Cm 4.7.4 
v= “T5 (fics ae 


27 Tn contrast, in interacting fermion systems there can be both Fermi and Bose 
quasiparticles. The particle number of bosonic quasiparticles is in general not 
fixed. Additional quasiparticles can be created; since the changes in the angular 
momentum of every quantum-mechanical system must be integral, these excita- 
tions must have integral spins. 
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Due to the gap in the roton energy (4.7.1b), the roton occupation number at 
low temperatures T < 2K can be approximated by n(ép) © e~%*P, and we 
find for the average number of rotons 


aes ee ee ae ee er 
NX oa | Pre . = sg | tere € 
0 


co 
e-BA | dp p? e~8(—Po)? /2H 


nhs 
An 1/2 _BA 
~ sae ~BA,2 sf dp o~B(P~ Po)? 2H — YP O- (on mpkT)Y? @-B4 
(4.7.5a) 
The contribution of the rotons to the internal energy is 
V 6) kT 
E, = —— | & Pep — N= | A Ny 4.7.5b 
ame | Pre p= (445) a. ary 
from which we obtain the specific heat 
ae AN? 
= — c 4.7. 
Cy = at +a (a) )™ (4.7.5c) 


where from (4.7.5a), N,; goes exponentially to zero for T — 0. In Fig. 4.30, 
the specific heat is drawn in a log-log plot as a function of the temperature. 
The straight line follows the T? law from Eq. (4.7.4c). Above 0.6 K, the roton 
contribution (4.7.5c) becomes apparent. 
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Fig. 4.30. The specific heat of 
helium II under the saturated 
vapor pressure (Wiebes, Niels- 
temperature (K) Hakkenberg and Kramers). 
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*4.7.3 Superfluidity and the Two-Fluid Model 


The condensation of helium and the resulting quasiparticle dispersion rela- 
tion (Eq. 4.7.1la,b, Fig. 4.29) have important consequences for the dynamic 
behavior of 4He in its He II phase. Superfluidity and its description in terms 
of the two-fluid model are among them. To see this, we consider the flow 
of helium through a tube in two different inertial frames. In frame K, the 
tube is at rest and the liquid is flowing at the velocity —v. In frame Ko, we 
suppose the helium to be at rest, while the tube moves with the velocity v 
(see Fig. 4.31). 


frame K 
(laboratory) 
Fig. 4.31. Superflu- 
id helium in the rest 
frame of the tube, kK, 
and in the rest frame 
of the liquid, Ko 


frame Ko 
(He rest frame) 


The total energies (EF, Eo) and the total momenta (P, Po) of the liquid in 
the two frames (K,Ko) are related by a Galilei transformation. 


P = Py — Mv (4.7.6a) 


M 2 
E=E)—Po-v+ —- (4.7.6b) 


Here, we have used the notation 
Spee P S "Pio = Po, Som =M. (4.7.6c) 


One can derive (4.7.6a,b) by applying the Galilei transformation for the individual 
particles 


Xi=Xio—-—vl , Pi =Pi0o—mMv. 
This gives for the total momentum 
P= So pi = S = (pio —mv)=Po- Mv 


and for the total energy 
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In an ordinary liquid, any flow which might initially be present is damped by 
friction. Seen from the frame Ko, this means that in the liquid, excitations 
are created which move along with the walls of the tube, so that in the course 
of time more and more of the liquid is pulled along with the moving tube. 
Seen from the tube frame K, this process implies that the flowing liquid is 
slowed down. The energy of the liquid must simultaneously decrease in order 
for such excitations to occur at all. We now need to investigate whether for 
the particular excitation spectrum of He II, Fig. 4.29, the flowing liquid can 
reduce its energy by the creation of excitations. 

Is it energetically favorable to excite quasiparticles? We first consider 
helium at the temperature T = 0, i.e. in its ground state. In the ground 
state, energy and momentum in the frame Ko are given by 


Eg and Po=0. (4.7.7a) 


It follows for these quantities in the frame K: 


Mv? 
Bo = Bf 4+ 


and P=—Mv. (4.7.7b) 


If a quasiparticle with momentum p and energy €p is created, the energy and 
the momentum in the frame Ko have the values 


Eo => Eg + Ep and Po =p, (4.7.7c) 


and from (4.7.6a,b) we find for the energy in the frame K: 


E=E§+ep—-p-v4 and P=p-—Mv. (4.7.7d) 


The excitation energy in K (the tube frame) is thus 
AE =ep-p-v. (4.7.8) 


AE is the energy change of the liquid due to the appearance of an excitation 
in frame K. Only when AF < 0 does the flowing liquid reduce its energy. 
Since € — pv is a minimum when p is parallel to v, the inequality 


E 
v>= 4.7.9a 
p ( ) 


must be obeyed for an excitation to occur. From (4.7.9a) we find the critical 
velocity (Fig. 4.32) 


Ue = (£) = 60 m/sec . (4.7.9b) 


If the flow velocity is smaller than v,, no quasiparticles will be excited and 
the liquid flows unimpeded and loss-free through the tube. This phenomenon 
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Fig. 4.32. Quasiparticles and the 
critical velocity 


is called superfluidity. The occurrence of a finite critical velocity is closely 
connected to the shape of the excitation spectrum, which has a finite group 
velocity at p = 0 and is everywhere greater than zero (Fig. 4.32). 

The value (4.7.9b) of the critical velocity is observed for the motion of ions 
in He II. The critical velocity for flow in capillaries is much smaller than v¢, 
since vortices occur already at lower velocities; we have not considered these 
excitations here. 

A corresponding argument holds also for the formation of additional exci- 
tations at nonzero temperatures. At finite temperatures, thermal excitations 
of quasiparticles are present. What effect do they have? The quasiparticles 
will be in equilibrium with the moving tube and will have the average veloc- 
ity of the frame Ko, v. The condensate, i.e. the superfluid component, is at 
rest in Ko. The quasiparticles have momentum p and an excitation energy 
of ep in Kg. The mean number of these quasiparticles is n(ep — p- v). (One 
has to apply the equilibrium distribution functions in the frame in which the 
quasiparticle gas is at rest! — and there, the excitation energy is ep — p-v). 
The momentum of the quasiparticle gas in Ko is given by 


Po = oat / d’ppn(ép—p-v). (4.7.10) 


For low velocities, we can expand (4.7.10) in terms of v. Using [ d?pp n(ep) = 
0 and terminating the expansion at first order in v, we find 


-V On -V 1 On 
Py & 3 : = 5} yeas 
°* Qrh fe PP(P-Y) 5 = Gan” ; | 4 oe ie 


where { d° pp; p; f (|p|) = 36: Jd? pp? f (|p|) was used. At low T, it suffices 
to take the phonon contribution in this equation into account, i.e. 


GY, hf oe eB 

Peo eg 47.11 

Oph ahs * 38 ‘| wee Be oe 
0 


After integration by parts and replacement of 4m f dee?/c® by f d?p, one 
obtains 
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Po ph = Sane xa | ®peonlee) ; 
We write this result in the form 
Po ph = Vpn ph V ; (4.7.12) 
where we have defined the normal fluid density by 
4B Qn (kT)? 


Poph 372 45 Be: 


compare (4.7.4b). In (4.7.13), the phonon contribution to p, is evaluated. 
The contribution of the rotons is given by 


(4.7.13) 


| a eee (4.7.14) 


Eq. (4.7.14) follows from (4.7.10) using similar approximations as in the de- 
termination of N, in Eq. (4.7.5a). One calls pp = Pn,pn+Pn,r the mass density 
of the normal component. Only this portion of the density reaches equilibrium 
with the walls. 

Using (4.7.10) and (4.7.12), the total momentum per unit volume, Po/V, 
is found to be given by 


Po/V = pav. (4.7.15) 


We now carry out a Galilei transformation from the frame Ko, in which the 
condensate is at rest, to a frame in which the condensate is moving at the 
velocity v,. The quasiparticle gas, i.e. the normal component, has the velocity 
Vn = V+ vs in this reference frame. The momentum is found from (4.7.15) 
by adding pv, due to the Galilei transformation: 


P/V = pvs + Pnv. 
If we substitute v = vy, — v;, we can write the momentum in the form 
P/V = psVs + PnVn ; (4.7.16) 
where the superfluid density is defined by 
Ps =P—Pn- (4.7.17) 


Similarly, the free energy in the frame Ko can be calculated, and from it, 
by means of a Galilei transformation, the free energy per unit volume of the 
flowing liquid in the frame in which the superfluid component is moving at 
vs (problem 4.23): 


1 1 
F(T,V,Vs,Vn)/V = F(T, V)/V + 5 hes + 5 Pn n (4.7.18) 
where the free energy of the liquid at rest, F(T, V) is given by (4.7.3a) and 
the relations which follow it. 
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Fig. 4.33. The superfluid and the 
normal density ps; and pn in He II 
as functions of the temperature, mea- 
sured using the motion of a torsional 
oscillator by Andronikaschvili. 


The hydrodynamic behavior of the helium in the He II phase is as would be 
expected if the helium consisted of two fluids, a normal fluid with the density 
Pn, which reaches equilibrium with obstacles such as the inner wall of a tube in 
which it is flowing, and a superfluid with the density p,, which flows without 
resistance. When T' — 0, ps — p and p, — 0; for T — Ty ps — 0 and 
Pn — p. This theoretical picture, the two-—fluid model of Tisza and Landau, 
was experimentally confirmed by Andronikaschvili, among others (Fig. 4.33). 
It provides the theoretical basis for the fascinating macroscopic properties of 
superfluid helium. 


Problems for Chapter 4 


4.1 Demonstrate the validity of equations (4.3.24a) and (4.3.24b). 


4.2 Show that the entropy of an ideal Bose (Fermi) gas can be formulated as 
follows: 


= a (np) log (np) (1 + (np)) log (1 + (np))) : 


Consider this expression in the classical limit, also, as well as in the limit T — 0. 


4.3 Calculate Cy, Cp, «r, and a for ideal Bose and Fermi gases in the limit of 
extreme dilution up to the order h?. 


4.4 Estimate the Fermi energies (in eV) and the Fermi temperatures (in K) for the 


2 2/3 2\ 2/3 
following systems (in the free—particle approximation: € = d ): 
wing sy i p pprox: ered Or ai : 


(a) Electrons in metal 
(b) Neutrons in a heavy nucleus 
(c) *He in liquid 7He (V/N = 46.2 A®). 
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4.5 Consider a one-dimensional electron gas (S = 1/2), consisting of N particles 
confined to the interval (0, L). 

(a) What are the values of the Fermi momentum pr and the Fermi energy er? 
(b) Calculate, in analogy to Sect. 4.3, u = u(T, N/L). 


: 2 
Result: pr = ahN ; p=er[1+ - () + O(T*)]. 


oF 
Give a qualitative explanation of the different sign of the temperature dependence 
when compared to the three-dimensional case. 


4.6 Calculate the chemical potential (7, N/V) for a two-dimensional Fermi gas. 


4.7 Determine the mean square deviation (AN)? = (N?) — (N)? of the number 
of electrons for an electron gas in the limit of zero temperature. 


4.8 Calculate the isothermal compressibility (Eq. (4.3.18)) of the electron gas at 
low temperatures, starting from the formula (4.3.14’) for the pressure, P = 2 EN + 
a any x. Compare with the mean square deviation of the particle number found 


in problem 4.7. 


4.9 Compute the free energy of the nearly degenerate Fermi gas, as well as a and 
Cp. 


4.10 Calculate for a completely relativistic Fermi gas (€p = pc) 
(a) the grand potential & 

(b) the thermal equation of state 

(c) the specific heat Cy. 

Consider also the limiting case of very low temperatures. 


4.11 (a) Calculate the ground state energy of a relativistic electron gas, 


Ep = V/(mec?)? + (pc)?, in a white dwarf star, which contains N electrons and N/2 
helium nuclei (at rest), and give the zero-point pressure for the two limiting cases 


2 
MeC 
trK<1: P= a tp 
2 
MeC 1 
tr >1:P= ee (1 =)’ 
Vv Lp 


xp = “©. How does the pressure depend on the radius R of the star? 


Mec 


(b) Derive the relation between the mass M of the star and its radius R for the 
two cases rp <1 and xr > 1, and show that a white dwarf can have no greater 
mass than 


My — 9m, [3x (_he \*” 
: 64 V a® \ym2 : 


awl, 


G=6.7 x 10° *dyn cm?g? Gravitational constant 


Mp = 1.7 X 10 “4 Proton mass 


(c) If a star of a given mass M = 2m,N is compressed to a (finite) radius R, then 
its energy is reduced by the self-energy E, of gravitation, which for a homogeneous 
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mass distribution has the form Ey = —aGM?/R, where a is a number of the order 
of 1. From 

dEy | dRdEg _ 0 

dV dV dR 
you can determine the equilibrium radius, with dEo = —Po(R)41R?dR as the 


differential of the ground-state energy. 


4.12 Show that in a two-dimensional ideal Bose gas, there can be no Bose-Einstein 
condensation. 


4.13 Prove the formulas (4.4.10) and (4.4.11) for the entropy and the specific heat 
of an ideal Bose gas. 


4.14 Compute the internal energy of the ideal Bose gas for T < T.(v). From the 
result, determine the specific heat (heat capacity) and compare it with Eq. (4.4.11). 


4.15 Show for bosons with ¢, = ap*® and uw = 0 that the specific heat at low 


temperatures varies as T?/* in three dimensions. In the special case of s = 2, this 
yields the specific heat of a ferromagnet where these bosons are spin waves. 


4.16 Show that the maximum in Planck’s formula for the energy distribution u(w) 
is at Wmax = 2.82 57; see (4.5.10). 


4.17 Confirm that the energy flux Ig(T) which is emitted by a black body of tem- 


— energy emitted __ 


perature T into one hemisphere is given by (Eq. (4.5.16)), I(T) = SSS 28 


aE = oT", starting from the energy current density 


. 1 p 
je = >> c@ep(np,) - 
Vv rae p 


The energy flux Jz per unit area through a surface element of df is j Eva: 


4.18 The energy flux which reaches the Earth from the Sun is equal to 6 = 
0,136 Joule sec” cm~? (without absorption losses, for perpendicular incidence). 
b is called the solar constant. 

(a) Show that the total emission from the Sun is equal to 4 x 10° Joule sec™!. 
(b) Calculate the surface temperature of the Sun under the assumption that it 
radiates as a black body (T' ~ 6000 K). 

Rs =7x 10cm, Roz = 1AU=1.5 x 10'8cm 


4.19 Phonons in a solid: calculate the contribution of the so called optical phonons 
to the specific heat of a solid, taking the dispersion relation of the vibrations to be 
e(k) = we (Einstein model). 


4.20 Calculate the frequency distribution corresponding to Equation (4.6.17a) for 
a one- or a two-dimensional lattice. How does the specific heat behave at low 
temperatures in these cases? (examples of low-dimensional systems are selenium 
(one-dimensional chains) and graphite (layered structure)). 
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4.21 The pressure of a solid is given by P = — ovo + + Hee (see (4.6.22)). Show, 
under the assumption that Wo(V) = (V—Vo)?/2x0Vo for V ~ Vo and yoCvT «K Vo, 
that the thermal expansion (at constant P ~ 0) can be expressed as 


_ 1 (OV\ _ yx0Cv _ Px0C¢T 
a= V ($F) Vo and Cp —Cy Vo ; 


4.22 Specific heat of metals: compare the contributions of phonons and electrons. 
Show that the linear contribution to the specific heat becomes predominant only 


at T < T* = 0.140p\/0p/Tr. Estimate T* for typical values of 6p and Tr. 


4.23 Superfluid helium: show that in a coordinate frame in which the superfluid 
component is at rest, the free energy F = E — TS is given by 


at 
Py + Pnv™ , where @, = = log [2 — eae || P 
“pe 
p 
Expand @, and show also that in the system in which the superfluid component is 
moving at a velocity vs, 
2 2 
nUn sUs 
F =p + Pen 4 Pee | 


Hint: In determining the free energy F’, note that the distribution function 
n for the quasiparticles with energy €p is equal to n(ép — p- v). 


4.24 Ideal Bose and Fermi gases in the canonical ensemble: 

(a) Calculate the canonical partition function for ideal Bose and Fermi gases. 
(b) Calculate the average occupation number in the canonical ensemble. 
Suggestion: instead of Zy, compute the quantity 


Za) Sy ean 
N=0 


and determine Zn using Zy = s+ § rata) dx, where the path in the complex x 


plane encircles the origin, but includes no singularities of Z(x). Use the saddle— 
point method for evaluating the integral. 


4.25 Calculate the chemical potential z for the atomic limit of the Hubbard model, 


N 
A = US) nitniy 5 


i=1 


where nj = Clete is the number operator for electrons in the state 7 (at lattice 


site 7) anda = +3. (In the general case, which is not under consideration here, the 
Hubbard model is given by: 


H= atlas =F US- nina :!) 


ijo i 


5. Real Gases, Liquids, and Solutions 


In this chapter, we consider real gases, that is we take the interactions of the 
atoms or molecules and their structures into account. In the first section, the 
extension from the classical ideal gas will involve only including the internal 
degrees of freedom. In the second section, we consider mixtures of such ideal 
gases. The following sections take the interactions of the molecules into ac- 
count, leading to the virial expansion and the van der Waals theory of the 
liquid and the gaseous phases. We will pay special attention to the transition 
between these two phases. In the final section, we investigate mixtures. This 
chapter also contains references to every-day physics. It touches on bordering 
areas with applications in physical chemistry, biology, and technology. 


5.1 The Ideal Molecular Gas 


5.1.1 The Hamiltonian and the Partition Function 


We consider a gas consisting of N molecules, enumerated by the index n. 
In addition to their translational degrees of freedom, which we take to be 
classical as before, we now must consider the internal degrees of freedom 
(rotation, vibration, electronic excitation). The mutual interactions of the 
molecules will be neglected. The overall Hamiltonian contains the transla- 
tional energy (kinetic energy of the molecular motion) and the Hamiltonian 
for the internal degrees of freedom H;,,, summed over all the molecules: 


N 2 
H= (+ Hn) (5.1.1) 


The eigenvalues of Hj, are the internal energy levels ¢\,,. The partition 
function is given by 


yy 2 
Z(L,V,N) = aaa | pi... Bp e7 Ue Pn /2mkT en eth 
The classical treatment of the translational degrees of freedom, represented 
by the partition integral over momenta, is justified when the specific volume 
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is much larger than the cube of the thermal wavelength A = 2rh//V2amkT 
(Chap. 4). Since the internal energy levels ¢;,, = ¢; are identical for all of the 
molecules, it follows that 


N 
Z(T,V,N) = wqleee(1) Zid® 7 - 5323 (5.1.2) 


where Z = )0., e—i/kT ig the partition function over the internal degrees of 
freedom and Z;,(1) is the translational partition integral for a single molecule. 
From (5.1.2), we find the free energy, using the Stirling approximation for 
large N: 
4 
F = —-kT log Z = —NkT E + log we + log a , (5.1.3) 


From (5.1.3), we obtain the equation of state 


P=- (3) las R (5.1.4) 


which is the same as that of a monatomic gas, since the internal degrees of 
freedom do not depend on V. For the entropy, we have 


OF 5 4 Olog Z; 
geen pace = Nk |< +log —~ + log Z,+T , (bl. 
S (sr). 5+ 8 Apa t los Zi + aT (5.1.5a) 
and from it, we obtain the internal energy, 
3 Olog Z; 
B=F+TS=NkT|=-4+T , 5.1.5b 
+TS k | 5 + Tar ( ) 


The caloric equation of state (5.1.5b) is altered by the internal degrees of 
freedom compared to that of a monatomic gas. Likewise, the internal degrees 
of freedom express themselves in the heat capacity at constant volume, 


OE 3 O _»Olog Z; 
= ( — = Nk i Me 
ey tar s+ ar OT 


Finally, we give also the chemical potential for later applications: 


OF V 
me ee ee ae an 
bb oe Tog (sh5%') (5.1.5c) 


it agrees with p = 7(F + PV), since we are dealing with a homogeneous 
system. 

To continue the evaluation, we need to investigate the contributions due 
to the internal degrees of freedom. The energy levels of the internal degrees 
of freedom are composed of three contributions: 


(5.1.6) 
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Ei Eel + Erotiae Evib « (5.1.7) 


Here, €¢) refers to the electronic energy including the Coulomb repulsion of the 
nuclei relative to the energy of widely separated atoms. €;ot is the rotational 
energy and ¢€yip is the vibrational energy of the molecules. 

We consider diatomic molecules containing two different atoms (e.g. HCl; 
for identical atoms, cf. Sect. 5.1.4). Then the rotational energy has the form! 


A7U(1 + 1) 
Lot! =. ——_.aa wl. 
Exot oT (5.1.8a) 


where / is the angular momentum quantum number and J = Mrea 2 the 
moment of inertia, depending on the reduced mass myeq and the distance 
between the atomic nuclei, Ry.? The vibrational energy ¢,i, takes the form! 


1 
Evib = hw ¢ + 5) ; (5.1.8b) 


where w is the frequency of the molecular vibration and n = 0,1,2,.... The 
electronic energy levels €g; can be compared to the dissociation energy Episs. 
Since we want to consider non-dissociated molecules, i.e. we require that 
kT < €piss, and on the other hand the excitation energies of the lowest 
electronic levels are of the same order of magnitude as épiss, it follows from 
the condition kT’ < episs that the electrons must be in their ground state, 
whose energy we denote by ¢°,. Then we have 


0 


Z; = exp (-#4) GoD. (5.1.9) 


We now consider in that order the rotational part Z,., and the vibrational 
part Zyip of the partition function. 


5.1.2 The Rotational Contribution 


Since the rotational energy é;ot (5.1.8a) does not depend on the quantum 
number m (the z component of the angular momentum), the sum over m 
just yields a factor (21+ 1), and only the sum over / remains, which runs over 
all the natural numbers 


Zror = (21+ 1) exp (-S*) (5.1.10) 
l=0 


' In general, the moment of inertia J and the vibration frequency w depend?on I. 
The latter dependence leads to a coupling of the rotational and the vibrational 
degrees of freedom. For the following evaluation we have assumed that these 
dependences are weak and can be neglected. 

? See e.g. QM I 
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Here, we have introduced the characteristic temperature 


h2 


Or = a 


(5.1.11) 


We next consider two limiting cases: 


T <O,: At low temperatures, only the smallest values of | contribute in 
(5.1.10) 


Zeot = 1+ 37/7 + 5039/7 +O (e-PO/T) (5.1.12) 


T > O,: At high temperatures, the sum must be carried out over all | 
values, leading to 


T1,10, »f/0\? 
Zr = 2a + 3+ ay -0(($) Ns (5.1.13) 


To prove (5.1.13), one uses the Euler-MacLaurin summation formula? 


1=0 


Lro= fare 5f(0) 3 OE FY (0) + Rest (5.1.14) 
0 


for the special case that f(oo) = f’(oo) =... = 0. The first Bernoulli numbers By 
are given by By = +, Bo = yo- The first term in (5.1.14) yields just the classical 
result 


faro = fa (21 +1) exp (8S) =2 f dre = 2 (5.1.15) 
0 0 0 


2 T r 


which one would also obtain by treating the rotational energy classically instead of 
quantum-mechanically.* The further terms are found via 


Oe yn 0 .(9\' 1(@:\° 


from which, using (5.1.14), we obtain the expansion (5.1.13). 
From (5.1.12) and (5.1.13), we find for the logarithm of the partition 
function after expanding: 


3 Whittaker, Watson, A Modern Course of Analysis, Cambridge at the Clarendon 
Press; V.I. Smirnow, A Course of Higher Mathematics, Pergamon Press, Oxford 
1964: Vol. III, Part 2, p. 290. 

4 See eg. A. Sommerfeld, Thermodynamics and Statistical Physics, Academic 
Press, NY 1950 


An I? B1724..2, QkT 
Zro = Fo a di d —y (ep +43) = 3 
' ay | wf sg iP 
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log Zrot = 
2T (on 1 0, 2 0; : 
loa) er an +0((F) ae 
(5.1.16a) 


From this result, the contribution of the rotational degrees of freedom to the 
internal energy can be calculated: 


4) 
Evot = NkT? — log Zyo 
; ap ot 


3NKO,(e-%/7 —3e°O/T 4.) T<O, 


= te, itis iene (5.1.16b) 
NET(1 : ( *) ! <4) iY 
6T  180\T ere 
The contribution to the heat capacity at constant volume is then 
O,\2 

3(=) eP/T(1- Ge O/T...) TKO, 

Cret = Nk (5.1.16c) 
1 /O,\? 
— {| — ae T con 
180 a a 


In Fig. 5.1, we show the rotational contribution to the specific heat. 


Cy" /Nk 
1.0 


0 0.5 10 5 TIO, Fig. 5.1. The rotational contribution 


to the specific heat 


At low temperatures, the rotational degrees of freedom are not thermally 
excited. Only at T ~ O,/2 do the rotational levels contribute. At high tem- 
peratures, i.e. in the classical region, the two rotational degrees of freedom 
make a contribution of 2kT/2 to the internal energy. Only with the aid of 
quantum mechanics did it become possible to understand why, in contradic- 
tion to the equipartition theorem of classical physics, the specific heat per 
molecule can differ from the number of degrees of freedom multiplied by k/2. 
The rotational contribution to the specific heat has a maximum of 1.1 at the 
temperature 0.81 0,/2. For HCl, 0,/2 is found to be 15.02 K. 
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5.1.3 The Vibrational Contribution 


We now come to the vibrational contribution, for which we introduce a char- 
acteristic temperature defined by 


hu = ko, . (5.1.17) 
We obtain the well-known partition function of a harmonic oscillator 
oo 7 —0, /2T 
ata SS gate ee © 
Zyib = dX ee 1 = e-Ov/T ’ (5.1.18) 
whose logarithm is given by log Zyin = os — log (1 a e O/T) | From it, we 
find for the internal energy: 
bye Nr Zyib = NO Se : (5.1.19a) 
vib — OT § Zvib = Vv 2 eOv/T — 1 ; oi. 
and for the vibrational contribution to the heat capacity at constant volume 
' Of eet e? 1 
vib Vv Vv 
= Nk ~ ———___=N : 1.1 
T? [e@e/T — 1)? kre [2sinh O, /2T]? ert) 


At low and high temperatures, from (5.1.19b) we obtain the limiting cases 


(Sy es, TKO, 


Cvib 
Vo (5.1.19c) 
Nk 
: (& ) oe PS 6 
alr oe axe 
The excited vibrational energy levels are noticeably populated only at tem- 


peratures above O,. The specific heat (5.1.19b) is shown in Fig. 5.2. 


C¥¥/NK 
1.0 


0) 5 1.0 1.5 2.0770 Fig. 5.2. The vibration- 
: al part of the specific heat 
(Eq. (5.1.19b)) 


The contribution of the electronic energy €% to the partition function, 
free energy, internal energy, entropy, and to the chemical potential is, from 
(5.1.9): 
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Zel oa e- eal kT ’ Fa Pam Nes ’ Ee coed Ney ’ Sel =0 ’ Hel = on : 
(5.1.20) 


These contributions play a role in chemical reactions, where the (outer) elec- 
tronic shells of the atoms undergo complete restructuring. 

In a diatomic molecular gas, there are three degrees of freedom due to 
translation, two degrees of freedom of rotation, and one vibrational degree 
of freedom, which counts double (E = - + ¥w?x?; kinetic and potential 
energy each contribute 4 kT). The classical specific heat is therefore 7k/2, as 
is observed experimentally at high temperatures. All together, this gives the 
temperature dependence of the specific heat as shown in Fig. 5.3. The curve is 
not continued down to a temperature of T = 0, since there the approximation 
of a classical ideal gas is certainly no longer valid. 


ee eee eer ree Dissociation 
Vibration 
a eee ret ee 
Rotation 

¥o+--- —q—* - - - - - J------------ 
Fig. 5.3. The specific 
heat of a molecular 
8, /2 6, T gas at constant volume 


(schematic) 


The rotational levels correspond to a wavelength of 4 = 0.1 — 1cm and 
lie in the far infrared and microwave regions, while the vibrational levels 
at wavelengths of \ = 2 x 107? — 3 x 107%cm are in the infrared. The 
corresponding energies are 107? — 1074 eV and 0.06 —0.04 eV, resp. (Fig. 5.4). 
One electron volt corresponds to about 11000 K (1K 4 0.86171 x 10~*eV). 
Some values of 0; and O, are collected in Table 5.1. 

In more complicated molecules, there are three rotational degrees of free- 
dom and more vibrational degrees of freedom (for n atoms, in general 3n — 6 
vibrational degrees of freedom, and for linear molecules, 3n — 5). In precise 
experiments, the coupling between the vibrational and rotational degrees of 
freedom and the anharmonicities in the vibrational degrees of freedom are 
also detected. 


i HD. Dy Hcl 6; 

10 (K] 85 64 43 15 2 Table 5.1. The values of 
‘ :/2 and Oy f l 

6, [K] 6100 5300 4300 4100 2200 Oe 7 and Oaon eexers 


molecules 
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E 
@,:0.1-1 cm microwaves 
2 10%. 10°“ev 
@,:2-3x10%cm_ infrared 
= 0.06 - 0.04 eV 
= n=1 
, \=3 
(=2 . 
O Tay Fig. 5.4. The structure of the 
ra. n=0 rotational and_ vibrational levels 
} (schematic) 


*5.1.4 The Influence of the Nuclear Spin 


We emphasize from the outset that here, we make the assumption that the 
electronic ground state has zero orbital and spin angular momenta. For nu- 
clei A and B, which have different nuclear spins S4 and Sz, one obtains 
an additional factor in the partition function, (254 + 1)(2S'p + 1),ie.Z - 
(254 + 1)(2S8 + 1)Z;. This leads to an additional term in the free en- 
ergy per molecule of —kT log(2S'4 + 1)(2Sp + 1), and to a contribution of 
klog(2S4 + 1)(2Sp + 1) to the entropy, ie. a change of the chemical con- 
stants by log(2S4 + 1)(2Sg +1) (see Eq. (3.9.29) and (5.2.5’)). As a result, 
the internal energy and the specific heats remain unchanged. 

For molecules such as H2, D2, O2 which contain identical atoms, one must 
observe the Pauli principle. We consider the case of Hz, where the spin of the 
individual nuclei is Sy = 1/2. 


Ortho hydrogen molecule: Nuclear spin triplet (Si. = 1); the spatial 
wavefunction of the nuclei is antisymmetric 


(1 = odd (u)) 
Para hydrogen molecule: Nuclear spin singlet (Sot = 0); the spatial 
wavefunction of the nuclei is symmetric (J = 
even (g)) 
i+ 1) 0, 
Gee = es a 
ie (t+ 1)exn/ a (5.1.21a) 
l odd(u) 
W(l+1) 0, 
Zy= 21+ 1 - — wl. 
a= >, (lt Jexp( a (5.1.21b) 


l even(g) 
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In complete equilibrium, we have 
Z=3L,+Z,. 


At T = 0, the equilibrium state is the ground state | = 0, i.e. a para state. 
In fact, owing to the slowness of the transition between the two spin states 
at T = 0, a mixture of ortho and para hydrogen will be obtained. At high 
temperatures, 2, © Z, © 4 at — = holds and the mixing ratio of ortho to 
para hydrogen is 3:1. If we start from this state and cool the sample, then, 
leaving ortho-para conversion out of consideration, Hz consists of a mixture 
of two types of molecules: 3N ortho and +N para hydrogen, and the partition 
function of this (metastable) non-equilibrium state is 


Zi Giy (Ag l (5.1.22) 
Then for the specific heat, we obtain 


3 1 
Cyt = ri vet mad (5.23) 
In Fig. 5.5, the rotational parts of the specific heat in the metastable state 
(3 ortho and + para), as well as for the case of complete equilibrium, are 
shown. The establishment of equilibrium can be accelerated by using cata- 
lysts. 


C*/Nk 


equilibrium 


4 
, metastable 


mixture Fig. 5.5. The rotational part of the 


specific heat of diatomic molecules 
6/2 T such as He: equilibrium (solid curve), 
metastable mixture (dashed) 


In deuterium molecules, Dz, the nuclear spin per atom is S$ = 1,° which 
can couple in the molecule to ortho deuterium with a total spin of 2 or 0 
and para deuterium with a total spin of 1. The degeneracy of these states 
is 6 and 3. The associated orbital angular momenta are even (g) and odd 
(u). The partition function, corresponding to Eq. (5.1.21a-b), is given by 
Z = 62,4 32Zy. 


> QM I, page 187 
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*5.2 Mixtures of Ideal Molecular Gases 


In this section, we investigate the thermodynamic properties of mixtures of 
molecular gases. The different types of particles (elements), of which there 
are supposed to be n, are enumerated by the index j7. Then N; refers to the 
particle number, A; = Gamer? is the thermal wavelength, c; = an the 
concentration, €% j the electronic ground state energy, Z; the overall tine 
function (see (5.1.2)), and Z;,; the partition function ie the internal degrees 
of freedom of the particles of type j. Here, in contrast to (5.1.9), the electronic 
part is separated out. The total number of particles is N = >> j N;. 
The overall partition function of this non-interacting system is 


=|[4. (5.2.1) 
j=l 
and from it we find the free energy 
= Be 1+ og | + ose , (5.2.2) 
: _ _ (OF 
From (5.2.2), we obtain the pressure, P = — ($0) 7 Nj} 
kT kTN 
BS NS ge (5.2.3) 


The equation of state (5.2.3) is identical to that of the monatomic ideal gas, 
since the pressure is due to the translational degrees of freedom. For the 
chemical potential 41; of the component j (Sect. 3.9.1), we find 


OF VZij 
bh; = (sx) = —kT log +62, ; (5.2.4) 

PON ON, Jang N; x a 
or, if we use the pressure from (5.2.3) instead of the volume, 


RTA 5 
Hy = —kT log GPX Be ag’ (5.2.4") 


We now assume that the rotational degrees of freedom are completely un- 
frozen, but not the vibrational degrees of freedom (0, < T < O,). Then 


inserting Zi; = Zrot,j = oan (see Eq. (5.1.13)) into (5.2.4’) yields 


3/2 


[ly = eng — Cer logkT — kT log tkTloge;P. (5.2.5) 


m; 
21/273/2 BB KO, ; 


We have taken the fact that the masses and the characteristic temperatures 
depend on the type of particle 7 into consideration here. The pressure enters 


*5.2 Mixtures of Ideal Molecular Gases 235 


the chemical potential of the component j in the combination c;P = Pj; 
(partial pressure). The chemical potential (5.2.5) is a special case of the 
general form 


j= e015 —cpjTlogkT —kT¢;+ kT loge;P . (5.2.5/) 


For diatomic molecules in the temperature range mentioned above, cp; = 
7k/2. The ¢; are called chemical constants; they enter into the law of mass 
action (see Chap. 3.9.3). For the entropy, we find 


OM; 
7 eB ; ( oT Mes 


2 


= oS (cp; logkT + cp; + k¢; — kloge;P) N; (5.2.6) 
j 


from which one can see that the coefficient cp; is the specific heat at constant 
pressure of the component j. 


Remarks to Sections 5.1 and 5.2: In the preceding sections, we have de- 
scribed the essential effects of the internal degrees of freedom of molecular gases. 
We now add some supplementary remarks about additional effects which depend 
upon the particular atomic structure. 


(i) We first consider monatomic gases. The only internal degrees of freedom are 
electronic. In the noble gases, the electronic ground state has L = S = 0 and is 
thus not degenerate. The excited levels lie about 20eV above the ground state, 
corresponding to a temperature of 200.000 K higher; in practice, they are therefore 
not thermally populated, and all the atoms remain in their ground state. One can 
also say that the electronic degrees of freedom are “frozen out”. The nuclear spin 
Sw leads to a degeneracy factor (2S~ +1). Relative to pointlike classical particles, 


the partition function contains an additional factor (25 + 1)e7°0/ kT’ which gives 
rise to a contribution to the free energy of ¢9 — kT log(2S'n + 1). This leads to an 
additional term of k log(2Sy~ +1) in the entropy, but not to a change in the specific 
heat. 

(ii) The excitation energies of other atoms are not as high as in the case of the noble 
gases, e.g. 2.1eV for Na, or 24.000 K, but still, the excited states are not thermally 
populated. When the electronic shell of the atom has a nonzero S, but still L = 0, 
this leads together with the nuclear spin to a degeneracy factor of (25 +1)(2Sy +1). 
The free energy then contains the additional term €o — kT log((2S'n + 1)(29 + 1)) 
with the consequences discussed above. Here, to be sure, one must consider the 
magnetic interaction between the nuclear and the electronic moments, which leads 
to the hyperfine interaction. This is e.g. in hydrogen of the order of 6 x 10~% eV, 
leading to the well-known 21 cm line. The corresponding characteristic temperature 
is 0.07 K. The hyperfine splitting can therefore be completely neglected in the gas 
phase. 

(iii) In the case that both the spin S and the orbital angular momentum L are 
nonzero, the ground state is (2S + 1)(2L + 1)-fold degenerate; this degeneracy is 
partially lifted by the spin-orbit coupling. The energy eigenvalues depend on the 
total angular momentum J, which takes on values between S + L and |S — L|. 
For example, monatomic halogens in their ground state have S = } and L = 1, 


2 
according to Hund’s first two rules. Because of the spin-orbit coupling, in the ground 
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state J = 3, and the levels with J = 4 have a higher energy. For e.g. chlorine, 
the doubly-degenerate ? P; /2 level lies d¢ = 0.11eV above the 4-fold degenerate 
? P32 ground state level. This corresponds to a temperature of oe = 1270K. The 
partition function now contains a factor Z.) = Ae 20/kT 4 De (e0t82)/kT due to 
the internal fine-structure degrees of freedom, which leads to an additional term in 
the free energy of —kT log Z. = €0 —kT log (4 +2 ek ‘i This yields the following 


electronic contribution to the specific heat: 


For T < 6¢/k, Za = 4, only the four lowest levels are populated, and cd =0. 
For T > 6e/k, Z.1 = 6, and all six levels are equally occupied, so that Ce = 

For temperatures between these extremes, C’*! passes through a maximum at about 
6¢/k. Both at low and at high temperatures, the fine structure levels express them- 
selves only in the degeneracy factors, but do not contribute to the specific heat. One 
should note however that monatomic Cl is present only at very high temperatures, 
and otherwise bonds to give Cle. 

(iv) In diatomic molecules, in many cases the lowest electronic state is not degener- 
ate and the excited electronic levels are far from €9. The internal partition function 
contains only the factor e *0/kT due to the electrons. There are, however, molecules 
which have a finite orbital angular momentum A or spin. This is the case in NO, 
for example. Since the orbital angular momentum has two possible orientations rel- 
ative to the molecular axis, a factor of 2 in the partition function results. A finite 
electronic spin leads to a factor (25+1). For S # 0 and A # 0, there are again 
fine-structure effects which can be of the right order of magnitude to influence the 
thermodynamic properties. The resulting expressions take the same form as those 
in Remark (iii). A special case is that of the oxygen molecule, Oz. Its ground state 
3 5) has zero orbital angular momentum and spin S' = 1; it is thus a triplet without 
fine structure. The first excited level 1A is doubly degenerate and lies relatively 
near at de = 0.97eV 411300K, so that it can be populated at high temperatures. 


—eé —ée 
These electronic configurations lead to a factor of err (3 +2 esr ) in the partition 
function, with the consequences discussed in Remark (iii). 


5.3 The Virial Expansion 


5.3.1 Derivation 


We now investigate a real gas, in which the particles interact with each other. 
In this case, the partition function can no longer be exactly calculated. For its 
evaluation, as a first step we will describe the virial expansion, an expansion 
in terms of the density. The grand partition function Zg can be decomposed 
into the contributions for 0,1,2, etc. particles 


Zq = Tre (F-#NVAT — 14.27, V,1) "PLZ (T, V,2) e2#/"F 4... , (5.3.1) 


where Zy = Z(T,V,N) represents the partition function for N particles. 
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From it, we obtain the grand potential, making use of the Taylor series ex- 
pansion of the logarithm 


1 
@ = —kT log Zg = —kT |Zie"/*P a (2 = 52?) etu/er zane ’ , (8.3.2) 


where the logarithm has been expanded in powers of the fugacity z = e#/*?. 
Taking the derivatives of (5.3.2) with respect to the chemical potential, we 
obtain the mean particle number 


: aD 1 
a ($= = Z,ehlT 4 2( Zs ie 522) emit He (5.3.3) 
TLV 


Eq. (5.3.3) can be solved iteratively for e“/*", with the result 


re 2 
git ae) (NY (5.3.4) 
Des NN eis 3 


Eq. (5.3.4) represents a series expansion of e“/*? in terms of the density, 
since Z, ~ V. Inserting (5.3.4) into @ has the effect that ® is given in terms 
of T,V, N instead of its natural variables T, V, 4, which is favorable for con- 
structing the equation of state: 


Be bagi 
o=-kT|N-(Z-=22)+...] 5.3.5 

2 2 1 Ze ( ) 
These are the first terms of the so called virial expansion. By application 
of the Gibbs—Duhem relation = —PV, one can go from it directly to the 
expansion of the equation of state in terms of the particle number density 
p=N/V 


p= kT pl +B(T)pt+C(T)p?+...] . (5.3.6) 


The coefficient of p” in square brackets is called the (n+1)th virial coefficient. 
The leading correction to the equation of state of an ideal gas is determined 
by the second virial coefficient 


1 
BS -(% = 52%) V/Z2 (5.3.7) 


This expression holds both in classical and in quantum mechanics. 


Note: in the classical limit the integrations over momentum can be carried out, 
and (5.3.1) is simplified as follows: 


oO eGuN 
Zc(T,V.n) = S0 spew Q(LV.N) . (5.3.8) 
N=0° ° 
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Here, Q(T,V,N) is the configurational part of the partition function 
Z(T,V,N) 


Q(T,V,N) ae OB Each =e [[G+ fs) = 
t<J 
i (5.3.9) 
BN | i | it 
ya (fie + fia +...) + (fiefia +...) +...) 
Vv 
with f;; = e~°’4 —1. In this expression, icy = 7 Li Loy ys Tefers to the sum over 


all pairs of particles. One can see from this that the virial expansion represents an 
expansion in terms of ré /v, where ro is the range of the potential. The classical 
expansion is valid for \ < ro < v'/?; see Eqs. (B.39a) and (B.39b) in Appendix 
B. Equation (5.3.9) can be used as the basis of a systematic graph-theoretical 
expansion (Ursell and Mayer 1939). 


5.3.2 The Classical Approximation for the Second Virial 
Coefficient 


In the case of a classical gas, one finds for the partition function for N particles 


ZN = — [pap ce Pay edi Bj / 2b (er nse) /RT) : 


(5.3.10a) 
after integrating over the 3N momenta, this becomes 
1 
Zn = NHI pen (kd Spe ON IEE: (5.3.10b) 


where v(x1,...,Xw) is the total potential of the N particles. The integrals 
over X; are restricted to the volume V. If no external potential is present, and 
the system is translationally invariant, so that the two-particle interaction 
depends only upon x; — Xa, we find from (5.3.10b) 


1 V 
Zi= 3 fon eo = iy (5.3.11a) 


and 


ae fy e@ MIKE. (5:3.11b) 


1 3 3 —v(xX1—X2)/kT 
Zo = 55 [day dnc (x1 —x2)/ = axe 


This gives for the second virial coefficient (5.3.7): 


B= -5 f ay fly) = -5 f ay (e-vonver = 1) (5.3.12) 


with f(y) = e-%)/'T _ 1, To proceed, we now require the two-particle 
potential u(y), also known as the pair potential. In Fig. 5.6, as an example, the 
Lennard-Jones potential is shown; it finds applications in theoretical models 
for the description of gases and liquids and it is defined in Eq. (5.3.16). 
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v(r) 


Lennard-Jones : 
potential Fig. 5.6. The Lennard—Jones potential as an ex- 


ample of a pair potential u(y), Eq. (5.3.16) 


5.3.2.1 A Qualitative Estimate of B(T) 


A typical characteristic of realistic potentials is the strong increase for over- 
lapping atomic shells and the attractive interaction at larger distances. A 
typical shape is shown in Fig. 5.7. Up to the so called ‘hard-core’ radius o, 
the potential is infinite, and outside this radius it is weakly negative. Thus 
the shape of f(r) as shown in Fig. 5.7 results. 


If we can now assume that in the region of the negative potential, ve) < 
1, then we find for the function in (5.3.12) 
—1 |x| <o 
= : 5.3.13 
f(x) _ u(x) ele ( ) 
kT = 
From this, we obtain the second virial coefficient: 
B(T) = -| “73 44m f de r?(—-v(r))/kT] = b- 2 (5.3.14) 
oars beer aw | dr r*(—v(r = oP? 3. 
where 
2 4 
b= 30 =4 > r (5.3.15a) 


denotes the fourfold molecular volume. For hard spheres of radius ro, @ = 2ro 
and 


Co 


1 
a= 2m f ar ru(r) = -5 | ae u(x)O(r —o). (5.3.15b) 
The result (5.3.14) for B(T) is drawn in Fig. 5.8. In fact, B(T) decreases again 
at higher temperatures, since the potential in Nature, unlike the artificial case 
of infinitely hard spheres, is not infinitely high (see Fig. 5.9). 


Remark: From the experimental determination of the temperature depen- 
dence of the virial coefficients, we can gain information about the potential. 
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0 
-1 
Fig. 5.7. A typical pair potential v(r) Fig. 5.8. The second virial coefficient 
(solid curve) and the associated f(r) from the approximate relation (5.3.14) 
(dashed). 
Examples: 
Lennard—Jones potential ((12-6)-potential): 
g\ 12 a\6 
u(r) = 4e (2) = (<) | : (5.3.16) 
r r 


exp-6-Potential : 


a-r o2\8 
= ee ace . 5.3.17 
u(r) =e fexp (*) - (2)" (5.3.17) 
The exp-6-potential is a special case of the so called Buckingham potential, 
which also contains a term « —r7®. 


5.3.2.2 The Lennard—Jones Potential 


We will now discuss the second virial coefficient in the case of a Lennard-— 
Jones potential 


oy =e[()"-(9)]. 


It proves expedient to introduce the dimensionless variables r* = r/o and 
T* =kT/e. is (5.3.12) by parts yields 


6 
B(T) = oe far tage? [Se a =| e@l=r—-sl, (5.3.18) 
* r* 
Expansion of the factor exp (747) in terms of os leads to 
zz gI-8/2 fe] , 
B = o pe (25+1)/4 
r ete) 


(5.3.19) 


Qn i 2.56 0.87 | 
= 3 Oo 


Txi/4 Tx3/4 Tx5/4 
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f 


bk 
hf 


“—— quantum theory for H. 


B* =3B/2xLe* 


tae! 
j / ~~ classical theory 


Fig. 5.9. The reduced second virial coefficient B* = 3B/2nLo® for the Lennard— 
Jones potential. L denotes the Loschmidt number (Avagadro’s number, L = 
6.0221367 - 10*°mol~'); after Hirschfelder et al.® and R.J. Lunbeck, Dissertation, 
Amsterdam 1950 


(see Hirschfelder et al.® Eq. (3.63)); the series converges quickly at large T*. 
In Fig. 5.9, the reduced second virial coefficient is shown as a function of T™. 


Remarks: 


(i) The agreement for the noble gases Ne, Ar, Kr, Xe after adjustment of o 
and ¢€ is good. 

(ii) At T* > 100, the decrease in B(T) is experimentally somewhat greater 
than predicted by the Lennard—Jones interaction (i.e. the repulsion is 
weaker). 

(iii) An improved fit to the experimental values is obtained with the exp-6- 
potential (5.3.17). 

(iv) The possibility of representing the second virial coefficients for classical 
gases in a unified form by introducing dimensionless quantities is an 
expression of the so called law of corresponding states (see Sect. 5.4.3). 


5.3.3 Quantum Corrections to the Virial Coefficients 


The quantum-mechanical expression for the second virial coefficient B(T) 
is given by (5.3.7), where the partition functions occurring there are to be 
computed quantum mechanically. The quantum corrections to B(T) and the 


° T. O. Hirschfelder, Ch. F. Curtiss and R. B. Bird, Molecular Theory of Gases and 
Liquids, John Wiley and Sons, Inc., New York 1954 
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other virial coefficients are of two kinds: There are corrections which result 
from statistics (Bose or Fermi statistics). In addition, there are corrections 
which arise from the non-commutativity of quantum mechanical observables. 
The corrections due to statistics are of the order of 

3 bosons 


epee ues 3 
BS 95/2 ~ h for fermions ’ cise) 


as one can see from Sect. 4.2 or Eq. (B.43). The interaction quantum correc- 
tions, according to Eq. (B.46), take the form 


h2 


Sim REP (5.3.21) 


Bigie / dy eT (July)? 


and are thus of the order of h?. The lowest-order correction given in (5.3.21) 
results from the non-commutativity of p* and v(x). 

We show in Appendix B.33 that the second virial coefficient can be related 
to the time which the colliding particles spend within their mutual potential. 
The shorter this time, the more closely the gas obeys the classical equation 
of state for an ideal gas. 


5.4 The Van der Waals Equation of State 


5.4.1 Derivation 


We now turn to the derivation of the equation of state of a classical, real 
(i.e. interacting) gas. We assume that the interactions of the gas atoms 
(molecules) consist only of a two-particle potential, which can be decom- 
posed into a hard-core (H.C.) part, vu.c.(y) for ly| < o, and an attractive 
part, w(y) (see Fig. 5.7): 


v(y) = va.c.(y) + u(y) - (5.4.1) 


The expression “hard core” means that the gas molecules repel each other at 
short distances like impenetrable hard spheres, which is in fact approximately 
the case in Nature. 

Our task is now to determine the partition function, for which after car- 
rying out the integrations over momenta we obtain 


1 
Z(T,V,N) = srt | Pn [oy en Vics VO—*V/ET (5.4.9) 


We still have to compute the configurational part. This can of course not 
be carried out exactly, but instead contains some intuitive approximations. 
Let us first ignore the attractive interaction and consider only the hard-core 
potential. This yields in the partition function for many particles: 
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fan Pay em Lica 0. VEIT (YY) . (5.4.3) 


This result can be made plausible as follows: if the hard-core radius were zero, 
o = 0, then the integration in (5.4.3) would give simply V; for a finite a, 
each particle has only V — Vo available, where Vo is the volume occupied by 
the other N — 1 particles. This is not exact, since the size of the free volume 
(V — Vo) depends on the configuration, as can be seen from Fig. 5.10. In 
(5.4.3), Vo is to be understood the occupied volume for typical configurations 
which have a large statistical weight. Then, one can imagine carrying out the 
integrations in (5.4.3) successively, obtaining a factor V — Vo for each particle. 
Referring to Fig. 5.10, we can find the following bounds for Vo with a particle num- 
ber N: the smallest Vo is obtained for spherical closest packing, V@"" = 4/2 rgN = 
5.65 rN. The largest Vo is found when the spheres of radius 2ro do not overlap, 
ie. Vor* = 8 rgN = 33.51 rgN. The actual Vo will lie between these extremes 
and can be determined as below from the comparison with the virial expansion, 
namely Vo = bN = 442 rgN = 16.75 rgN. 
Using (5.4.3), we can cast the partition function (5.4.2) in the form 


(V ca Vo)™ f Px, ae f @xn e7 H.C. 7 ics W(Ki— xy) /kT 
ABN N! f Ba,... f Bay eH. : 


2(T,V,N) = 


(5.4.4) 


Here, H.C. stands for the sum of all contributions from the hard-core poten- 
tial divided by kT’. The second fraction can be interpreted as the average 
of exp{ — ic W(Ki — x;)/ Kr in a gas which experiences only hard-core 
interactions. Before we treat this in more detail, we want to consider the 
second exponent more closely. For potentials whose range is much greater 
than o and the distance between particles, it follows approximately that the 
potential acting on 7 due to the other particles, 


Fig. 5.10. Two configurations of three atoms within the volume V. In the first 
configuration, Vo is larger than in the second. The center of gravity of an additional 
atom must be located outside the dashed circles. In the second configuration (closer 
packing), there will be more space for an additional atom (spheres of radius ro are 
represented by solid circles, spheres of radius o = 2ro by dashed circles) 
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igs WiKi — xj) & (N-1 Sf 2 w(x), i.e. the sum over all pairs 


1 1 Ses eee 

w(x —x,;)= ope w(x xj) © 5N(N 1)o® 5 Nw (5.4.5a) 

t<Jj 4 iA9 
with 

1 2 

w= z fee w(x) = aa A (5.4.5b) 

Thus we find for the partition function 
(V=Voy¥ MRD 9 _ (V— Vo) Pe 
Z(T,V,N)= ayn ® 2 ee = apie ee % (5.4.6) 


In this calculation, the attractive part of the potential was replaced by its av- 
erage value. Here, as in the molecular field theory for ferromagnetism which 
will be treated in the next chapter, we are using an “average-potential ap- 
proximation”. 

Before we discuss the thermodynamic consequences of (5.4.6), we return 
once more to (5.4.4) and the note which followed it. The last factor can be 
written using a cumulant expansion, Eq. (1.2.16’), in the form 


(e- ic; eS = ert -( w(x — x;)/KT) 


a H.C. 
+5 (((Sweu - 5/47) ) — (So wiles —x5)/AL) Jef 
. - ee 54.7) 


The average values ( ),; ¢. are taken with respect to the canonical distribution 


function of the total hard-core potential. Therefore, on <j W(X — x;)) 
H.C. 
refers to the average of the attractive potential in the “free” volume allowed 


by the interaction of hard spheres. Under the assumption made earlier that 
the range is much greater than the hard-core radius o and the particle dis- 
tance, we again find (5.4.5a,b) and (5.4.6). The second term in the cumulant 
series (5.4.7) represents the mean square deviation of the attractive inter- 
actions. The higher the temperature, the more dominant the term w/kT 
becomes. 

From (5.4.6), using N! ~ NNe~ \/2rN, we obtain the free energy, 


e(V—Vo) Na 
F=—kTN\ A. 
o8 Say v (5.4.8) 
the pressure (the thermal equation of state), 
OF kT N N?a 
roe es a 5.4.9 
oe Vy VP’ ee 
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and, with E = —T? (4) vn» the internal energy (caloric equation of state), 
3 Na 
E = -—NkT —- —. 5.4.10 
5 V ( ) 
Finally, we can relate Vo to the second virial coefficient. To do this, we expand 
(5.4.9) in terms of 1/V and identify the result with the virial expansion (5.3.6) 


and (5.3.14): 


kTN Vo aN kTN a\WN 
P= 14+ fi... = 14 
V V kTV | V | (6 i) V | 
From this, we obtain 
Vo = Nb, (5.4.11) 


where 6 is the contribution to the second virial coefficient which results from 
the repulsive part of the potential. Inserting in (5.4.9), we find 


pee, (5.4.12) 


where on the right-hand side, the specific volume v = V/N was introduced. 
Equation (5.4.12) or equivalently (5.4.9) is the van der Waals equation of 
state for real gases,” and (5.4.10) is the associated caloric equation of state. 


Remarks: 


(i) The van der Waals equation (5.4.12) has, in comparison to the ideal gas 
equation P = kT/v, the following properties: the volume v is replaced 
by uv — 8, the free volume. For v = b, the pressure would become infinite. 
This modification with respect to the ideal gas is caused by the repulsive 
part of the potential. 

(ii) The attractive interaction causes a reduction in the pressure via the 
term —a/v*. This reduction becomes relatively more important as the 
temperature is lowered. 


(iii) We make another comparison of the van der Waals equation to the ideal gas 
equation by writing (5.4.12) in the form 


(P+ 5) @-b)=er. 


Compared to Pv = kT, the specific volume v has been decreased by b, because 
the molecules are not pointlike, but instead occupy their own finite volumes. 
The mutual attraction of the molecules leads at a given pressure to a reduction 
of the volume; it thus acts like an additional pressure term. One can also readily 
understand the proportionality of this term to 1/ vu’. If one considers the surface 
layer of a liquid, it experiences a kind of attractive force from the deeper-lying 
layers, which must be proportional to the square of the density, since if the 
density were increased, the number of molecules in each layer would increase 
in proportion to the density, and the attractive force per unit area would thus 
increase proportionally to 1/ vu. 


” Johannes Dietrich van der Waals, 1837-1923: equation of state formulated 1873, 
Nobel prize 1910 
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The combined action of the two terms in the van der Waals equation 
results in qualitatively different shapes for the isotherms at low (71,72) and 
at high (73,74) temperatures. The family of van der Waals isotherms is shown 
in Fig. 5.11. For T > Ti, the isotherms are monotonic, while for T < Ty, they 
are S-shaped; the significance of this will be discussed below. 


220 
P 
1.5 
Pe 
5 ae 
0 Fig. 5.11. The van der Waals 
O vw 2 4 6 v 8 isotherms in dimensionless units 


P/P, and v/ve 


We see immediately that on the so called critical isotherm, there is a critical 


point, at which the first and second derivatives vanish, i.e. a horizontal point 
of inflection. The critical point T., P., V- thus follows from ae a eas = 0. 


This leads to the two conditions eT gs fe 2 Oe SO he Tom 
(v—b) v (v—b) 
which the values 


8a a 
e= 380, kTo= Soy Pe =a Al 
oe 27 b 27h (1) 
are obtained. The dimensionless ratio 
kT, 8 : 
=-~=2.6 5.4.14 
P.ve 3 ( ) 


follows from this. The experimental value is found to be somewhat larger. 


Note: It is apparent even from the derivation that the van der Waals equation can 
have only approximate validity. This is true of both the reduction of the repulsion 
effects to an effective molecular volume b, and of the replacement of the attractive 
(negative) part of the potential by its average value. The latter approximation 
improves as the range of the interactions increases. In the derivation, correlation 
effects were neglected, which is questionable especially in the neighborhood of the 
critical point, where strong density fluctuations will occur (see below). However, the 
van der Waals equation, in part with empirically modified van der Waals constants 
a and 6}, is able to give a qualitative description of condensation and of the behavior 
in the neighborhood of the critical point. There are numerous variations on the van 
der Waals equation; e.g. Clausius suggested the equation 
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3 Fig. 5.12. Isotherms for carbonic acid ob- 

g tained from Clausius’ equation of state. 
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The plot of its isotherms shown in Fig. 5.12 is similar to that obtained from the 
van der Waals theory. 


5.4.2 The Maxwell Construction 


At temperatures below T,, the van der Waals isotherms have a typical S- 
shape (Fig. 5.12). The regions in which (OP/OV)r > 0, i.e. the free energy is 
not convex and therefore the stability criterion (3.6.48b) is not obeyed, are 
particularly disturbing. The equation of state definitely requires modification 
in these regions. We now wish to consider the free energy within the van 
der Waals theory. As we finally shall see, an inhomogeneous state containing 
liquid and gaseous phases has a lower free energy. In Fig. 5.13, a van der Waals 
isotherm and below it the associated free energy f(T,v) = F(T,V)/N are 
plotted. Although the lower figure can be directly read off from Eq. (5.4.8), 
it is instructive and useful for further discussion to determine the typical 
shape of the specific free energy from the isotherms P(T,v) by integration of 


P=-— (¥).. over volume: 


f(E,v) = f(Py00) — f de! PD) (5.4.15) 
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Fig. 5.13. A van der Waals isotherm and 
the corresponding free energy in the di- 
mensionless units P/P., v/vc and f/kTe. 
t(T.v)=f(T.v9)- ] dv'P(T.v’) The free energy of the heterogeneous state 

= (dashed) is lower than the van der Waals 
free energy (solid curve) 


The integration is carried out from an arbitrary initial value vg of the specific 
volume up to v. We now draw in a horizontal line intersecting the van der 
Waals isotherm in such a way that the two shaded areas are equal. The 
pressure which corresponds to this line is denoted by Po. This construction 
yields the two volume values v, and v2. The values of the free energy at the 
volumes 11,2 will be denoted by f1,2 = f(T, v1,2). At the volumes v; and vo, 
the pressure assumes the value Py and therefore the slope of f(T, v) at these 
points has the value —Po. As a reference for the graphical determination 
of the free energy, we draw a straight line through (v, f1) with its slope 
equal to —Pp (shown as a dashed line). If the pressure had the value P5 
throughout the whole interval between v1 and v2, then the free energy would 
be fi; — Po(v — v1). We can now readily see that the free energy which is 
shown in Fig. 5.13 follows from P(T,v), since the van der Waals isotherm 
to the right of v; initially falls below the horizontal line P = Py. Thus the 
negative integral, i.e. the free energy which corresponds to the van der Waals 
isotherm, lies above the dashed line. Only when the volume v2 has been 
reached is fo = f(T, v2) = fi — Po(v2 — v1), owing to the equal areas which 
were presupposed in drawing the horizontal line, and the two curves meet 
again. Due to Py = ut a —of py? the (dashed) line with slope —Pp is 
precisely the double tangent to the curve f(T,v). Since P > Po for v < vy 
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and P < Py for v > v2, f in these regions also lies above the double tangent. 
In Fig. 5.13 we can see that the free energy calculated in the van der Waals 


theory is not convex everywhere, (0 > rf = et = 1); this violates the 


thermodynamic inequality (3.3.5). 
For comparison, we next consider a two-phase, heterogeneous system, 


whose entire material content is divided into a fraction c; = ~2=> in the 
state (v1, 7) and a fraction cp = 2" in the state (v2,T). These states have 


V2—-V1 
the same pressure and temperature and can exist in mutual equilibrium. Since 


the free energy of this inhomogeneous state is given by the linear combination 
afit+ cafe of f; and fs, it lies on the dashed line.? Thus, the free energy of 
this inhomogeneous state is lower than that from the van der Waals theory. 
In the interval [v1, v2] (two-phase region), the substance divides into two 
phases, the liquid phase with temperature and volume (T,v1), and the gas 
phase with (7, v2). The pressure in this interval is Po. The real isotherm is 
obtained from the van der Waals isotherm by replacing the S-shaped portion 
by the horizontal line at P = Py, which divides the area equally. Outside the 
interval [v1, v2], the van der Waals isotherm is unchanged. This construction 
of the equation of state from the van der Waals theory is called the Maxwell 
construction. The values of v1 and vg depend on the temperature of the 
isotherm considered, i.e. vy = v1(T’) and vg = v2(T). As T approaches T., 
the interval [v1(T),v2(L)] becomes smaller and smaller; as the temperature 
decreases below T’,, the interval becomes larger. Correspondingly, the pressure 
Po(T) increases or decreases. In Fig. 5.14, the Maxwell construction for a 
family of van der Waals isotherms is shown. The points (Po(T),v1(T)) and 


~2 


-3 


-4 


Fig. 5.14. Van der Waals isotherms, showing the Maxwell construction and the 
resulting coexistence curve (heavy curve) in the dimensionless units P/P. and v/vc, 
as well as the free energy f 


8 


ata =l, uate =v,afitefe =afi + ce(fi — Po(v2 —v1)) = 
fi — Po(v — v1). 
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(Po(T), v2(L)) form the liquid branch and the gas branch of the coexistence 
curve (heavy curves in Fig. 5.14). The region within the coexistence curve is 
called the coexistence region or two-phase region. In this region the isotherms 
are horizontal, the state is heterogeneous, and it consists of both the liquid 
and gaseous phases from the two limiting points of the coexistence region. 


Remarks: 


(i) In Fig. 5.15, the PVT-surface which follows from the Maxwell construc- 
tion is shown schematically. The van der Waals equation of state and the 
conclusions which can be drawn from it are in accord with the general 
considerations concerning the liquid-gas phase transition in the frame- 
work of thermodynamics which we gave in Sect. 3.8.1. 


Fig. 5.15. The surface 
of the equation of state 
from the van der Waals 
theory with the Maxwell 
equal-area construction 
(schematic). Along with 
three isotherms at tem- 
peratures 7; < T. < To, 
the coexistence curve 
(surface) and its projec- 
tion on the T-V plane are 
v shown 


P(v), T=const. 


(ii) The chemical potentials » = f + Pv of the two coexisting liquid and 
gaseous phases are equal. 

(iii) Kac, Uhlenbeck and Hemmer’ calculated the partition function exactly for a 
one-dimensional model with an infinite-range potential 


fore) |a| < xo 
v(x) = eee freee and kt — 0. 


The result is an equation of state which is qualitatively the same as in the 
van der Waals theory. In the coexistence region, instead of the S-shaped curve, 
horizontal isotherms are found immediately. 

(iv) A derivation of the van der Waals equation for long-range potentials akin to 
L.S. Ornstein’s, in which the volume is divided up into cells and the most 
probable occupation number in each cell is calculated, was given by van Kam- 
pen’®. The homogeneous and heterogeneous stable states were found. Within 
the coexistence region, the heterogeneous states — which are described by the 
horizontal line in the Maxwell construction — are absolutely stable. The two 
homogeneous states, represented by the S-shaped van der Waals isotherms, are 
metastable, as long as oP < 0, and describe the superheated liquid and the 
supercooled vapor. 


° M. Kac, G.E. Uhlenbeck and P.C. Hemmer, J. Math. Phys. 4, 216 (1963) 
10 N.G. van Kampen, Phys. Rev. 135, A362 (1964) 
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5.4.3 The Law of Corresponding States 


If one divides the van der Waals equation by P. = s7g2 and uses the reduced 
variables P* = 4 ,Ve= 2, = - , then a dimensionless form of the 
equation is obtained: 

8T* 3 


In these units, the equation of state is the same for all substances. Substances 

with the same P*, V* and thus T* are in corresponding states. Eq. (5.4.16) 

is called the “law of corresponding states”; it can also be cast in the form 
eee... 8 3P* T* 


KO i PEs Ps mp) ALT*x ¢ 
T 3 Te (Pty fh EME 


This means that P*V*/T™* as a function of P* yields a family of curves with 
the parameter 7*. All the data from a variety of liquids at fixed T* lie on a 
single curve (Fig. 5.16). This holds even beyond the validity range of the van 
der Waals equation. Experiments show that liquids behave similarly when 
P, V and T are measured in units of P., V. and T,. This is illustrated for a 
series of different substances in Fig. 5.16. 
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Fig. 5.16. The law of corresponding states.'? 
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5.4.4 The Vicinity of the Critical Point 


We now want to discuss the van der Waals equation in the vicinity of its 
critical point. To do this, we write the results in a form which makes the 


) G.J. Su, Ind. Engng. Chem. analyt. Edn. 38, 803 (1946) 
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analogy to other phase transitions transparent. The usefulness of this form 
will become completely clear in connection with the treatment of ferromag- 
nets in the next chapter. The equation of state in the neighborhood of the 
critical point can be obtained by introducing the variables 


AP=P-—P,, Av=v-u, AT=T-T, (5.4.17) 
and expanding the van der Waals equation (5.4.12) in terms of Av and AT: 
p= k(T. + AT) a 


2b + Av (3b + Av)? 
Seg ay) ae) ae) 
api 2p aap) ACG) 8) P) 


From this expansion, we find the equation of state in the immediate neigh- 
borhood of its critical point!” 


AP* =4 AT* — 6 AT* Av* — : (Av*)P +... 3 (5.4.18) 


it is in this approximation antisymmetric with respect to Av*, see Fig. 5.17. 


(ve, Te; Pz) 


Fig. 5.17. The coexistence curve in the vicinity of the critical point. Due to the 
term 4 AT™* in the equation of state (5.4.18), the coexistence region is inclined with 
respect to the V-T plane. The isotherm shown is already so far from the critical 
point that it is no longer strictly antisymmetric 


' The term AT(Av)? and especially higher-order terms can be neglected in the 
leading calculation of the coexistence curve, since it is effectively of order (AT)? 
in comparison to ~ (AT)*/ ? for the terms which were taken into account. The 
corrections to the leading critical behavior will be summarized at the end of 
this section. In Eq. (5.4.18), for clarity we use the reduced variables defined just 
before Eq. (5.4.16): AP* = AP/P,; etc. 


5.4 The Van der Waals Equation of State 253 


The Vapor-Pressure Curve: We obtain the vapor-pressure curve by pro- 
jecting the coexistence region onto the P-T plane. Owing to the antisymmetry 
of the van der Waals isotherms with respect to Av* in the neighborhood of 
T. (Eq. 5.4.18), we can easily determine the location of the two-phase region 
by setting Av* = 0 (cf. Fig. 5.17), 


AP* =4 AT". (5.4.19) 


The Coexistence Curve: 

The coexistence curve is the projection of the coexistence region onto the V-T 
plane. Inserting (5.4.19) into (5.4.18), we obtain the equation 0 = 6 AT* Av*+ 
3/2 (Av*)? with the solutions 


Avé = —Avf, = V/4(—AT*) + O(AT*) (5.4.20) 


for T < T,. For T < JT, the substance can no longer occur with a single 
density, but instead splits up into a less dense gaseous phase and a denser 
liquid phase (cf. Sect. 3.8). Avé and Avy represent the two values of the 
order parameter for this phase transition (see Chap. 7). 


The Specific Heat: 


T > T. : From Eq. (5.4.10), the internal energy is found to be E = 3NkT — 
N2 

a, : 

Therefore, the specific heat at constant volume outside the coexistence region 

is 


Cy = =Nk, (5.4.21a) 


as for an ideal gas. We now imagine that we can cool a system with precisely 
the critical density. Above T, it has the homogeneous density 1/v,., while 


below T, it divides into the two fractions (as in (5.4.20)) eg = 72=k and 
cL = ar with a gaseous phase and a liquid phase. T < T,: below T,, the 


internal energy is given by 


ae SkT (Z *) ep Ve EUS AUD. 51) 


N va UL 2 : (ve + Ave) (ve + Avy) 


If we insert (5.4.20), or, anticipating later results, (5.4.29),'3 we obtain 


= ets a 9 56a (T-T.\” 5 
B= N(5Ar at ght T) +3 ( = ) +0((an) ) 


13 With (5.4.20), one finds only the jump in the specific heat; in order to determine 
the linear term in (5.4.21b) as well, one must continue the expansion of vg and 
Avz, Eq. (5.4.27). Including these higher terms, the coexistence curve is not 
symmetric. 
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Fig. 5.18. The specific heat in the neigh- 
borhood of the critical point of the van der 
Waals liquid 


The specific heat 


28 T — T. 
25 Ty 


Oy = “Nk Nk (1 =) for T < T, (5.4.21c) 


exhibits a discontinuity (see Fig. 5.18). 

The Critical Isotherm: 

In order to determine the critical isotherm, we set AT* = 0 in (5.4.18). The 
critical isotherm 


APS —5(Av")? (5.4.22) 


is a parabola of third order; it passes through the critical point horizontally, 
which implies divergence of the isothermal compressibility. 


The Compressibility: 


To calculate the isothermal compressibility Kr = -> (Sb) we determine 
oP* 9 

N = —6 AT* Av")? 9.4.23 

(s) ; (40) (5.4.23) 


from the van der Waals equation (5.4.18). For T > T,, we find along the 
critical isochores (Av* = 0) 


1 1 T. 1 
= = z 4,24 
“? 6P, AT*  6P, AT eae) 
For T < Ty, along the coexistence curve (i.e. Av* = Avé = —Avf), using 


Eq. (5.4.20), we obtain the result N (35), = —6 AT*—9 (Aut)? = 24 AT*, 
that is 


(5.4.24b) 
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The isothermal compressibility diverges in the van der Waals theory above 
and below the critical temperature as (JT — T,)~!. The accompanying long- 
range density fluctuations lead to an increase in light scattering in the forward 
direction (critical opalescence; see (9.4.51)). 


Summary: 

Comparison with experiments shows that liquids in the neighborhood of 
their critical points exhibit singular behavior, similar to the results described 
above. The coexistence line obeys a power law; however the exponent is not 
1/2, but instead @ ~ 0.326; the specific heat is in fact divergent, and is char- 
acterized by a critical exponent a. The critical isotherm obeys AP ~ Av? 
and the isothermal compressibility is hr ~ |T — T,|~7. Table 5.2 contains 
a summary of the results of the van der Waals theory and the power laws 
which are in general observed in Nature. The exponents 3, a, 6, and ¥ are 
called critical exponents. The specific heat shows a discontinuity according 
to the van der Waals theory, as shown in Fig. 5.18. It is thus of the order of 
(T — T.)° just to the left and to the right of the transition. The index d of 
the exponent 0 in Table 5.2 refers to this discontinuity. Compare Eq. (7.1.1). 


Table 5.2. Critical Behavior according to the van der Waals Theory 


Physical van der Waals Critical Temperature 
quantity behavior range 

Ave =—-Av, ~(Te-T)? = (T —T)? Tet, 
cv ~(T—T.)% |%.-—T\7° T2T. 
AP ~ (Av)? (Av)? T=T. 
Kr Y|P=T|\- [P= Try TST: 


The Latent Heat Finally, we will determine the latent heat just below 
the critical temperature. The latent heat can be written using the Clausius— 
Clapeyron equation (3.8.8) in the form: 


poPo 
lor 


poPo 


——(vG = UL) = Tor —— (Ave a Avy) . 


q=T(sa-st) = 


Here, sq and sz refer to the entropies per particle of the gas and liquid 
phases and oi is the slope of the vaporization curve at the corresponding 
point. In the vicinity of the critical point, to leading order we can set Tof oR 
Te oral 6S where (OPo/OT)c.p. is the slope of the evaporation curve at ie 


critical potik 


q = 2T. (=) Ave . (5.4.25) 
c.p. 
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The slope of the vapor-pressure curve at T. is finite (cf. Fig. 5.17 and 
Eq. (5.4.19)). Thus the latent heat decreases on approaching T, according 
to the same power law as the order parameter, i.e. g x (T. — T)%; in the van 
der Waals theory, 3 = $. 

By means of the thermodynamic relation (3.2.24) Cp—Cy = -T Gey / 
(So) we can also determine the critical behavior of the specific heat at 


constant pressure. Since ($F), is finite, the right-hand side behaves like the 
isothermal compressibility «rp, and because Cy is only discontinuous or at 


most weakly singular, it follows in general that 

Cper~kr x(T-T.) 7; (5.4.26) 
for a van der Waals liquid, 7 = 1. 
“Higher-Order Corrections to Eq. (5.4.18) 


For clarity, we use the reduced quantities defined in (5.4.16). Then the van der 
Waals equation becomes 


AP* = 4AT* — 6AT* Av” + 9AT* (Av*)? — (G 3 Zar’) (Av*)? 
(J + Sars) (a0) + (= + ** ar") (Av")® + 0((40")°) . 


(5.4.27) 


The coexistence curve Avé yy and the vapor-pressure curve, which we denote here 
by APj(AT*), are found from the van der Waals equation: 


AP*(AT™, Avg) = AP*(AT*, Avz) = 0 
with the Maxwell construction 
Avg 
/ d(Av") (AP* — APS(AT*)) =0. 
Avy, 
For the vapor-pressure curve in the van der Waals theory, we obtain 


APs =4AT* + = (-ar")? +0((-ar*)*/?) , (5.4.28) 


and for the coexistence curve: 


Avg = 2V—AT* 4 = ( ATV AX (AT) LO (Cary) 


18 ; (5.4.29) 
Avi, = —2V—AT* + = (-AT*) + ¥(-AT*)*? + O((AT*)’) 
(with X —Y = a see problem 5.6). In contrast to the ferromagnetic phase 


transition, the order parameter is not exactly symmetric; instead, it is symmetric 
only near T., compare Eq. (5.4.20). 
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The internal energy is: 


E 3 a » 06 *\2 *)5/2 
= = Shr “(1 4AT* — = (AT*) + O(|AT" )) (5.4.30) 


and the heat capacity is: 


_ 3 9 28 * *)3/2 
Cy = SNk+ SNa(a sIAr |+0(|Ar | )) (5.4.31) 


For the calculation of the specific heat, only the difference X — Y = 294/25 enters. 
The vapor-pressure curve is no longer linear in AT”, and the coexistence curve is 
no longer symmetric with respect to the critical volume. 


5.5 Dilute Solutions 


5.5.1 The Partition Function and the Chemical Potentials 


We consider a solution where the solvent consists of N particles and the 
solute of N’ atoms (molecules), so that the concentration is given by 


N' 
=—<«<l. 
c N < 
We shall calculate the properties of such a solution by employing the grand 
partition function!* 


Ze(T,V, uu) = So Zu (TV, wz” 
n’=0 


— Zo(T,V,u) +2’ Zi (T, V, p) + O(2'”) # (5.5.1) 


It depends upon the chemical potentials of the solvent, jz, and of the solute, 
uw’. Since the solute is present only at a very low concentration, we have 
uw’ <0 and therefore the fugacity 2! = e’/*T < 1. In (5.5.1), Zo(T, V, 1) 
means the grand partition function of the pure solvent and Z;(T, V, jz) that 
of the solvent and a dissolved molecule. 

From these expressions we find for the total pressure 


P kT 
P= 7 ae: log Za = volT, pw) + 2’ yi (T, pw) + O(2'*) ; (5.5.2) 
where yo = —F log Zo and y, = —P 2. In (5.5.2), yo(T, 4) is the con- 


tribution of the pure solvent and the second term is the correction due to 


4 Here, Zy(T, V,u) = or Trn Tre Blin tH t Wat HM) where Tr,, and Tr,’ 
refer to the traces over n- and n/-particle states of the solvent and the solute, 
respectively. The Hamiltonians of these subsystems and their interactions are 
denoted by H,, Hi, and Wyn. 


258 5. Real Gases, Liquids, and Solutions 


the dissolved solute. Here, Z; and therefore y, depend on the interactions 
of the dissolved molecules with the solvent, but not however on the mutual 
interactions of the dissolved molecules. We shall now express the chemical 
potential yz in terms of the pressure. To this end, we use the inverse function 
yo’ at fixed T, ie. yo '(T, yo(T, u)) = pw, obtaining 


w= —p (T,-P — zy (T, 1) 


:: T, po (T,—-P 5.5.3 
= oT, P) 7 aig fo ( )) + O(2z'7) ; ( ) 
On B= (T,-P) 
The (mean) particle numbers are 
Of Opo(T 
N= = yPlP A) 5 ayy (5.5.4a) 
Ou Ou 
Of ZV 2 
P= =~ _y (T. ire .5.4b 
ay rp Pu M+ O(zZ) (5.5.4b) 
Inserting this into (5.5.3), we finally obtain 
wT, ge i) _ HoT, P) —kT c+ O(c?) ’ (5.5.5) 


where pio(T, P) = yp ‘(T, —P) is the chemical potential of the pure solvent as 
a function of T and P. From (5.5.4b) and (5.5.4a), we find for the chemical 
potential of the solute: 


—N'kT 
bw =kTlogz'’ = kT log (Far) + O(z’") 


VeoulT, p) 5.5.6 
peng! ee eel nee 
Neu (T, 1) 
and finally, using (5.5.5), 
u(T, P,c) = kTloge+g(T, P) + O(c). (5.5.7) 


In the function g(T,P) = kT log(kT/vo(T, P)yi(T, wo(T, P))), which de- 
pends only on the thermodynamic variables T and P, the interactions of 
the dissolved molecules with the solvent also enter. 

The simple dependences of the chemical potentials on the concentration 
are valid so long as one chooses T and P as independent variables. From 
(5.5.5), we can calculate the pressure as a function of T and yu. To do this, we 
use Po(T, 4), the inverse function of o(T, P), and rewrite (5.5.5) as follows: 


b= bo (T, Po(T, LL) a (P _ Po(T, L))) = Khies 


we then expand in terms of P—P (T, jz) and use the fact that uo(T, Po(T, w)) = 
L holds for the pure solvent: 
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a: JLo 
w= pt (FB) (P= Porn) kre 


From the Gibbs-Duhem relation, we know that (3) = u(P,T) =v+ 
i 
O(c”), from which it follows that 


P=P(T,p)+ “kT + O(c?) , (5.5.8) 


where v is the specific volume of the solvent. The interactions of the dissolved 
atoms with the solvent do not enter into P(T, uw, c) and p(T, P,c) to the order 
we are considering, although we have not made any constraining assumptions 
about the nature of the interactions. 


*“An Alternate Derivation of (5.5.6) and (5.5.7) 

in the Canonical Ensemble 

We again consider a system with two types of particles which are present in the 
amounts (particle numbers) N and N’, where the concentration of the latter type, 


c= x < 1, is very small. The mutual interactions of the dissolved atoms can 
be neglected in dilute solutions. The interaction of the solvent with the solute is 
denoted by Wyn. Furthermore, the solute is treated classically. We initially make 
no assumptions regarding the solvent; in particular, it can be in any phase (solid, 
liquid, gaseous). 

The partition function of the overall system then takes the form 


Z = Tren tn /ET aan —(H) +Wyry)/kT 
; (5.5.9a) 
= (tee =) ayer | Bary . eae i ae ’ 


where ’ is the thermal wavelength of the dissolved substance. Hy and H NP are the 
Hamiltonians for the solvent and the solute molecules, Vj, denotes the interactions 
of the solute molecules, and Wyn is the interaction of the solvent with the solute. 
A configurational contribution also enters into (5.5.9a): 


Zconf = fae aes Carn: (entra /r 


_ f Bai . day Tre En /kT eo Vn t+Wy ny) /kT 


Tre~Hn/kT 


(5.5.9b) 


The trace runs over all the degrees of freedom of the solvent. When the latter must 
be treated quantum-mechanically, Wyn also contains an additional contribution 
due to the nonvanishing commutator of Hy and the interactions. Vy, depends on 
the {x’} and Wyy on the {x’} and {x} (coordinates of the solute molecules and 
the solvent). We assume that the interactions are short-ranged; then Vy; can be 
neglected for all the typical configurations of the dissolved solute molecules: 


Ce ge) ES Ca, 


& on (Www) /BE +3 (Wary (Ww) ))/ (RLY? E.-- 


= sy, (Wei ETA ((W2, 5 (Wainy?))#-) 


a eA (5.5.9¢) 
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Here, W,,n denotes the interaction of molecule n’ with the N molecules of the 
solvent. In Eq. (5.5.9c), a cumulant expansion was carried out and we have taken 
into account that the overlap of the interactions of different molecules vanishes for 
all of the typical configurations. Owing to translational invariance, the expectation 
values (W,,v) etc. are furthermore independent of x’ and are the same for all n’. We 
thus find for each of the dissolved molecules a factor e~ Y(7:”/% ), where ~ depends 
on the temperature and the specific volume of the solvent. It follows from (5.5.9c) 
that the partition function (5.5.9a) is 


Z = (Tre HN /AE 1 Vv 7 
= ( re ) a (5aVEVIN)) (5.5.10) 
This result has the following physical meaning: the dissolved molecules behave like 
an ideal gas. They are subject at every point to the same potential from the sur- 
rounding solvent atoms, i.e. they are moving in a position-independent effective 
potential kT'~)(T, V/N), whose value depends on the interactions, the temperature, 
and the density. The free energy therefore assumes the form 


eV 
N’)3 


F(T,V, N,N’) = Fo(T,V, N) — kTN’ log —N'y(T,V/N) , (5.5.11) 
where Fo(T,V) = —kT log Tr e 4n/*T is the free energy of the pure solvent and 
y(T,V/N) = kT logy(T,V/N) is due to the interactions of the dissolved atoms 
with the solvent. From (5.5.11), we find for the pressure 


Ud 
r= (28), = err (ey 
— F sk (5.5.12) 
c 
= Po(T2) + ee (Sotte)) 
where c= x and v = x were employed. 


We could calculate the chemical potentials from (5.5.11) as functions of T and 
v. In practice, however, one is usually dealing with physical conditions which fix the 
pressure instead of the specific volume. In order to obtain the chemical potentials 
as functions of the pressure, it is expedient to use the free enthalpy (Gibbs free 
energy). It is found from (5.5.11) and (5.5.12) to be 


' eV . oy 
G = F+PV =Go(T, P,N)—-kTN (108 rae 1) N (> vt) , (5.5.13) 


where Po(T,v) and Go(T, P, N) are the corresponding quantities for the pure sol- 
vent. From Equation (5.5.12), one can compute v as a function of P, T and c, 


v=w(T,P)+O(N'/N) . 
If we insert this in (5.5.13), we find an expression for the free enthalpy of the form 


12 


a ) , (5.5.14) 


G(T, P, N,N’) = Go(T, P, N)—kTN' (og = 1) 4N'Q(T, P)+0( 


where g(T, P) = ( kT log xs (y v 33) . Now we can compute the 
v=v9(T,P) 
two chemical potentials as functions of T, P and c. For the chemical potential of the 


solvent, u(T, P,c) = ($8), p.nv the result to leading order in the concentration is 
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u(T, P,c) = po(T, P) — kTe+ O(c?) . (5.5.15) 
For the chemical potential of the solute, we find from (5.5.14) 
wu (T, P,c) = OG = —kT log u + g(T,P)+O(c). (5.5.16) 
ON') y pr c 


The results (5.5.15) and (5.5.16) agree with those found in the framework of the 
grand canonical ensemble (5.5.5) and (5.5.7). 


5.5.2 Osmotic Pressure 


We let two solutions of the same substances (e.g. salt in water) be separated 
by a semipermeable membrane (Fig. 5.19). An example of a semipermeable 
membrane is a cell membrane. 


Fig. 5.19. A membrane which allows only 
the solvent to pass through (= semiperme- 
able) separates the two solutions. - = sol- 
vent, e = solute; concentrations ci and ce 


The semipermeable membrane allows only the solvent to pass through. There- 
fore, in chambers 1 and 2, there will be different concentrations c; and cg. 
In equilibrium, the chemical potentials of the solvent on both sides of the 
membrane are equal, but not those of the solute. The osmotic pressure is 
defined by the pressure difference 


AP=P,-P,. 


From (5.5.8), we can calculate the pressure on both sides of the membrane, 
and since in equilibrium, the chemical potentials of the solvent are equal, 
[41 = Le, it follows that the pressure difference is 


Ci — C2 


AP = RE: (5.5.17) 


UV 


The van’t Hoff formula is obtained as a special case for cp = 0, c; = c, when 
only the pure solvent is present on one side of the membrane: 
N' 
AP = =37 Si (5.5.17) 
v V 


Here, N’ refers to the number of dissolved molecules in chamber 1 and V to 
its volume. 
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Notes: 


(i) Equation (5.5.17’) holds for small concentrations independently of the 
nature of the solvent and the solute. We point out the formal similarity 
between the van’t Hoff formula (5.5.17’) and the ideal gas equation. The 
osmotic pressure of a dilute solution of n moles of the dissolved substance 
is equal to the pressure that n moles of an ideal gas would exert on the 
walls of the overall volume V of solution and solvent. 

(ii) One can gain a physical understanding of the origin of the osmotic pres- 
sure as follows: the concentrated part of the solution has a tendency to 
expand into the less concentrated region, and thus to equalize the con- 
centrations. 

(iii) For an aqueous solution of concentration c = 0.01, the osmotic pressure 
at room temperature amounts to AP = 13.3 bar. 


*5.5.3 Solutions of Hydrogen in Metals (Nb, Pd,...) 


We now apply the results of Sect. 5.5.1 to an important practical example, the 
solution of hydrogen in metals such as Nb, Pd,... (Fig. 5.20). In the gas phase, 
hydrogen occurs in molecular form as Hg, while in metals, it dissociates. We 
thus have a case of chemical equilibrium, see Sect. 3.9.3. 


Fig. 5.20. Solution of hydrogen in metals: atomic 
hydrogen in a metal is represented by a dot, while 
molecular hydrogen in the surrounding gas phase is 
represented by a pair of dots. 


The chemical potential of molecular hydrogen gas is 


4 kT 
bin, = —kT |log Me + log z, = —-kT ow Pe +logZ;| , (5.5.18) 
He He 


where Z; also contains the electronic contribution to the partition function 
(Eq. (5.1.5c)). The chemical potential of atomic hydrogen dissolved in a metal 
is, according to Eq. (5.5.7), given by 


fH = kT loge+g(T,P). (5.5.19) 


The metals mentioned can be used for hydrogen storage. The condition for 
chemical equilibrium (3.9.26) is in this case 2H = uy,; this yields the equi- 
librium concentration: 
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1 
PAR, \? -2 —29(T,P)+e 
— eltlttg /2—g(T,P))/kT _ ( + CH2 —3 T4G\h 4) Tt Eel 
c= ea ( LT ) Z; exp ( OKT ) . (5.5.20) 


Since g(T, P) depends only weakly on P, the concentration of undissolved 
hydrogen is c ~ P2. This dependence is known as Sievert’s law. 


5.5.4 Freezing-Point Depression, Boiling-Point Elevation, 
and Vapor-Pressure Reduction 


Before we turn to a quantitative treatment of freezing-point depression, 
boiling-point elevation, and vapor-pressure reduction, we begin with a qual- 
itative discussion of these phenomena. The free enthalpy of the liquid phase 
of a solution is lowered, according to Eq. (5.5.5), relative to its value in the 
pure solvent, an effect which can be interpreted in terms of an increase in en- 
tropy. The free enthalpies of the solid and gaseous phases remain unchanged. 
In Fig. 5.21, G(T, P) is shown qualitatively as a function of the temperature 
and the pressure, keeping in mind its convexity, and assuming that the dis- 
solved substance is soluble only in the liquid phase. The solid curve describes 
the pure solvent, while the change due to the dissolved substance is described 
by the chain curve. As a rule, the concentration of the solute in the liquid 
phase is largest and the associated entropy increase leads to a reduction of the 
free enthalpy. From these two diagrams, the depression of the freezing point, 
the elevation of the boiling point, and the reduction in the vapor pressure 
can be read off. 


G T=const 
(3), =V 


liquid 
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Fig. 5.21. The change in the free enthalpy on solution of a substance which dis- 
solves to a notable extent only in the liquid phase. The solid curve is for the pure 
solvent, the chain curve for the solution. We can recognize the freezing-point de- 
pression, the boiling-point elevation, and the vapor-pressure reduction 


Next we turn to the analytic treatment of these phenomena. We first 
consider the melting process. The concentrations of the dissolved substance 


264 5. Real Gases, Liquids, and Solutions 


in the liquid and solid phases are cy, and cg.!° The chemical potentials of 
the solvent in the liquid and the solid phase are denoted by p’ and pS, and 
correspondingly in the pure system by pj) and yi. From Eq. (5.5.5), we find 
that 


ph = uh(P,T) — kTe 
and 


p® = u5(P,T) — kT eg . 


S 


d 


In equilibrium, the chemical potentials of the solvent must be equal, pw’ = py 
from which it follows that!® 


pb (P,T) — kT en, = pB(P,T) — kT cg . (5.5.21) 


For the pure solute, we obtain the melting curve, i.e. the relation between 
the melting pressure Pp and the melting temperature To, from 


16 (Po; To) = 46 (Po; To) - (515.22) 


Let (Po, To) be a point on the melting curve of the pure solvent. Then consider 
a point (P,T) on the melting curve which obeys (5.5.21), and which is shifted 
relative to (Po, To) by AP and AT, that is 


P=PeAP,. T= LAr. 


If we expand Eq. (5.5.21) in terms of AP and AT, and use (5.5.22), we find 
the following relation 
aulk| sp, v8 


Ap + 26 
@P |, | oF 


Ss 
Ap + OH 


Ops 
AT —kTc, = —2 
é OT 


AT —kTeg . (5.5.2 
ae cg . (5.5.23) 


0 


We now recall that G= uN = E—TS+ PV, and using it we obtain 
dG = —SdT + VdP + udN = d(pN) = pdN + Ndp , 


CL ee NS ese aig 
OPjay No ST }py MN 


The derivatives in (5.5.23) can therefore be expressed in terms of the volumes 
per molecule vy, and vg, and the entropies per molecule sz and sg in the liquid 
and solid phases of the pure solvent, 


'® Since two phases and two components are present, the number of degrees of 
freedom is two (Gibbs’ phase rule). One can for example fix the temperature and 
one concentration; then the other concentration and the pressure are determined. 

16 The chemical potentials of the solute must of course also be equal. From this fact, 
we can for example express the concentration in the solid phase, cg, in terms of 
T and cy. We shall, however, not need the exact value of cg, since cys < cy is 
negligible. 
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—(sg a su )AT + (ug a up) AP = (cg _ cL )kT $ (5.5.24) 
Finally, we introduce the heat of melting g = T(s, — sg), thus obtaining 

£Ar + (vg — un) AP = (eg — cy)kT . (5.5.25) 
The change in the transition temperature AT at a given pressure is obtained 
from (5.5.25), by setting P = Py or AP = 0: 

kT? 

AT = are: — cL) : (5.5.26) 
As arule, the concentration in the solid phase is much lower than that in the 
liquid phase, i.e. cs < cy; then (5.5.26) simplifies to 


kT? 
AT = ———cy, <0. (5.5.26') 
qd 


Since the entropy of the liquid is larger, or on melting, heat is absorbed, it 
follows that q > 0. As a result, the dissolution of a substance gives rise to a 
freezing-point depression. 


Note: On solidification of a liquid, at first (5.5.26’) holds, with the initial concen- 
tration cy. Since however pure solvent precipitates out in solid form, the concen- 
tration cr increases, so that it requires further cooling to allow the freezing process 
to continue. Freezing of a solution thus occurs over a finite temperature interval. 


The above results can be transferred directly to the evaporation process. 
To do this, we make the replacements 


L-G,S—L 


and obtain from (5.5.25) for the liquid phase (L) and the gas phase (G) the 
relation 


SAT + (vu, — vg) AP = (er —ca)kT (5.5.27) 
Setting AP = 0 in (5.5.27), we find 


kT? kT? 
AT = (cL ca) x ch > 0, (5.5.28) 


a boiling-point elevation. In the last equation, cy, >> cq was assumed (this no 
longer holds near the critical point). 
Setting AT = 0 in (5.5.27), we find 


AP SS py ee 
UL — UG UG 


(5.5.29) 
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a vapor-pressure reduction. When the gas phase contains only the vapor of 
the pure solvent, (5.5.29) simplifies to 


APS aT (5.5.30) 
UG 


Inserting the ideal gas equation, Pug = kT, we have 


AP= cLP = ci(Po + AP). 


Rearrangement of the last equation yields the relative pressure change: 


AP cL 


cae ~~ 


Po 1lt+e 


cL (5.5.31) 


known as Raoult’s law. The relative vapor-pressure reduction increases lin- 
early with the concentration of the dissolved substance. The results derived 
here are in agreement with the qualitative considerations given at the begin- 
ning of this subsection. 


Problems for Chapter 5 


5.1 The rotational motion of a diatomic molecule is described by the angular 
variables Y and y and the canonically conjugate momenta py and p, with the 


2 
Hamilton function H = oe + sho pe. Calculate the classical partition function 
for the canonical ensemble. 
Result: Zrot = 2 (see footnote 4 to Eq. (5.1.15). 


5.2 Confirm the formulas (5.4.13) for the critical pressure, the critical volume, 
and the critical temperature of a van der Waals gas and the expansion (5.4.18) of 
P(T,V) around the critical point up to the third order in Av. 


5.3 The expansion of the van der Waals equation in the vicinity of the critical 
point: 

(a) Why is it permissible in the determination of the leading order to leave off the 
term AT(AV)? in comparison to (AV)?? 

(b) Calculate the correction O(AT) to the coexistence curve. 

(c) Calculate the correction O((T' — T.)”) to the internal energy. 


5.4 The equation of state for a van der Waals gas is given in terms of reduced 
variables in Eq. (5.4.16). 

Calculate the position of the inversion points (Chap. 3) in the p*,7* diagram. 
Where is the maximum of the curve? 


5.5 Calculate the jump in the specific heat c, for a van der Waals gas at a specific 
volume of v # Ue. 


5.6 Show in general and for the van der Waals equation that «, and cy exhibit the 
same behavior for T — Ty.. 
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5.7 Consider two metals 1 and 2 (with melting points T,, T2 and temperature 
independent heats of melting qi, gz), which form ideal mixtures in the liquid 
phase (i.e. as for small concentrations over the whole concentration range). In the 
solid phase these metals are not miscible. Calculate the eutectic point Tp (see also 
Sect. 3.9.2). 

Hint: Set up the equilibrium conditions between pure solid phase 1 or 2 and the 
liquid phase. From these, the concentrations are determined: 


Gg =e, where A; = eu jn ; ¢#=1,2 
kT; T 
using 
a(G/T) 


=—H/T’ = AH; , =pN. 
aT 3) GE G=p 


5.8 Apply the van’t Hoff formula (5.5.17’) to the following simple example: the 
concentration of the dissolved substance is taken to be c = 0.01, the solvent is 
water (at 20°C); use px.o = 1g/cm* (20°C). Find the osmotic pressure AP. 


6. Magnetism 


In this chapter, we will deal with the fundamental phenomenon of magnetism. 
We begin the first section by setting up the density matrix, starting from the 
Hamiltonian, and using it to derive the thermodynamic relations for magnetic 
systems. Then we continue with the treatment of diamagnetic and paramag- 
netic substances (Curie and Pauli paramagnetism). Finally, in Sect. 6.5.1, we 
investigate ferromagnetism. The basic properties of magnetic phase transi- 
tions will be studied in the molecular-field approximation (Curie-Weiss law, 
Ornstein-Zernike correlation function, etc.). The results obtained will form 
the starting point for the renormalization group theory of critical phenomena 
which is dealt with in the following chapter. 


6.1 The Density Matrix and Thermodynamics 


6.1.1 The Hamiltonian and the Canonical Density Matrix 


We first summarize some facts about magnetic properties as known from 
electrodynamics and quantum mechanics. The Hamiltonian for N electrons 
in a magnetic field H = curl A is: 


N 
1 e€ ce spin 
H= >) 5 (p: =o es (x:)) — wi? A (xi) + Woo. (6.1.1) 
i=1 


The index i enumerates the electrons. The canonical momentum of the ith 
electron is p; and the kinetic momentum is mv; = p; — £A (x;). The charge 
and the magnetic moment are given by 

= GeLB 


e=-e, primes = S;, (6.1.2a) 


where along with the elementary charge eg, the Bohr magneton 


h J 
ge = hoor: 10 = 0.927 - 10-8 (6.1.2b) 


2mc auss 
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as well as the Landé-g-factor or the spectroscopic splitting factor of the elec- 
tron 


Je = 2.0023 (6.1.2c) 


were introduced. The quantity y = $2 = —“* is called the magnetome- 
chanical ratio or gyromagnetic ratio. The last term in (6.1.1) stands for the 
Coulomb interaction of the electrons with each other and with the nuclei. 
The dipole-dipole interaction of the spins is neglected here. Its conse- 
quences, such as the demagnetizing field, will be considered in Sect. 6.6; see 
also remark (ii) at the end of Sect. 6.6.3. We assume that the magnetic field 
H is produced by some external sources. In vacuum, B = H holds. We use 
here the magnetic field H, corresponding to the more customary practice in 


the literature on magnetism. The current-density operator is thus given by? 


N 


j00) = eRe = {5 [(pi— £4 Ow) 6-0] 


i=1 


+ 


+c curl (ae 6 (x — x:)) \ 
(6.1.3) 
with [A,B], = AB+ BA. The current density contains a contribution from 


the electronic orbital motion and a spin contribution. 
For the total magnetic moment, one obtains**: 


p= x | tex xi = {5x x (pi — <A (x,)) + ppt . (6.14) 


i=1 
When H is uniform, Eq. (6.1.4) can also be written in the form 


OH 


The magnetic moment of the ith electron for a uniform magnetic field (see 
Remark (iv) in Sect. 6.1.3) is — according to Eq. (6.1.4) — given by 


j= eS A) 


: 2me a (6.1.6) 
2 
a ee (Li + geSi) Ime (H xi — xX; (X; -H)) 


? The intermediate steps which lead to (6.1.3)-(6.1.5) will be given at the end of 
this section. 

3 J.D. Jackson, Classical Electrodynamics, 2nd edition, John Wiley and sons, New 
York, 1975, p. 18. 

4 Magnetic moments are denoted throughout by p, except for the spin magnetic 
moments of elementary particles which are termed p*?™”. 


6.1 The Density Matrix and Thermodynamics 271 
If H = He,, then (for a single particle) it follows that 


e€ es er ae OH 
(Li + geSi), i) Amc2 (a7 + y7) = OH ’ 


Biz = one 


and the Hamiltonian is” 


“fp? e Ce sie ae 
= i L; + 2S;), H 24 ait COL: 
H i ( S;),H + ane (2 +2) b+We 1. (6.1.7) 


Here, we have used go. = 2. 

We now wish to set up the density matrices for magnetic systems; we 
can follow the steps in Chap. 2 to do this. An isolated magnetic system is 
described by a microcanonical ensemble, 


pmo =6(H—E)/Q(E,H) with @(£,H) =Tr 6(H-—B), 


where, for the Hamiltonian, (6.1.1) is to be inserted. If the magnetic system 
is in contact with a heat bath, with which it can exchange energy, then one 
finds for the magnetic subsystem, just as in Chap. 2, the canonical density 
matrix 


1 
p= mons (6.1.8) 
The normalization factor is given by the partition function 
ZT etl. (6.1.9a) 


The canonical parameters (natural variables) are here the temperature, whose 
reciprocal is defined as in Chap. 2 in the microcanonical ensemble as the 
derivative of the entropy of the heat bath with respect to its energy, and the 
external magnetic field H.® Correspondingly, the canonical free energy, 


F(T, H) = —kT log Z , (6.1.9b) 


is a function of T and H. The entropy S and the internal energy E are, by 
definition, calculated from 


1 


5 =—k (log p) = = (E- F), (6.1.10) 


> See e.g. QM I, Sect. 7.2. 

° In this chapter we limit our considerations to magnetic effects. Therefore, the 
particle number and the volume are treated as fixed. For phenomena such as 
magnetostriction, it is necessary to consider also the dependence of the free 
energy on the volume and more generally on the deformation tensor of the solid 
(see also the remark in 6.1.2.4). 
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and 
E=(K). (6.1.11) 


The magnetic moment of the entire body is defined as the thermal average of 
the total quantum-mechanical magnetic moment 


M = ub =-(Se) (6.1.12) 


The magnetization M is defined as the magnetic moment per unit volume, 
i.e. for a uniformly magnetized body 


M= aM (6.1.13a) 
and, in general, 
M = [@emtes). (6.1.13b) 


For the differential of F’, we find from (6.1.9)—(6.1.10) 
dT 


dF = (F—E)—--M dH =—Sdl—-M_-dH, (6.1.14a) 
that is 
OF OF 
ae d ace = —M. 1.14 
ics ale Bi ae aay 


Using equation (6.1.10), one can express the internal energy F in terms of F’ 
and S and obtain from (6.1.14a) the First Law for magnetic systems: 


dE =TdS —MdH . (6.1.15) 


The internal energy F contains the interaction of the magnetic moments 
with the magnetic field (see (6.1.7)). Compared to a gas, we have to make 
the following formal replacements in the First Law: V — H, P — M. Along 
with the (canonical) free energy F'(T,H), we introduce also the Helmholtz 
free energy" 


A(T,M) = F(T,H)+M.-H. (6.1.16) 
Its differential is 
dA = —SdT + HdM , (6.1.17a) 
i.e. 
OA OA 
pata ie ere —H. AL 
Ore ae ey, oa) 


” The notation of the magnetic potentials is not uniform in the literature. This is 
true not only of the choice of symbols; even the potential F(T, H), which depends 
on H, is sometimes referred to as the Helmholtz free energy. 
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6.1.2 Thermodynamic Relations 
*6.1.2.1 Thermodynamic Potentials 


At this point, we summarize the definitions of the two potentials introduced 
in the preceding subsection. The following compilation, which indicates the 
systematic structure of the material, can be skipped over in a first reading: 


F=F(T,H)=E-TS, dF =—SdT —MdH_ (6.1.18a) 
A=A(T,M)=E-TS4+M-H, dA=-—SdT+HdM. (6.1.18b) 


In comparison to liquids, the thermodynamic variables here are T,H and 
M instead of T, P and V. The thermodynamic relations listed can be read 
off from the corresponding relations for liquids by making the substitutions 
V — —M and P — H. There is also another analogy between magnetic 
systems and liquids: the density matrix of the grand potential contains the 
term —N, which in a magnetic system corresponds to -H-M. Particularly in 
the low-temperature region, where the properties of a magnetic system can be 
described in terms of spin waves (magnons), this analogy is useful. There, the 
value of the magnetization is determined by the number of thermally-excited 
spin waves. Therefore, we find the correspondence M — N and H © wp. Of 
course the Maxwell relations follow from (6.1.15) and (6.1.18a,b) 


(aa) ,= Gar (sa). - (GE) (6.1.19) 


*6.1.2.2 Magnetic Response Functions, Specific Heats, 
and Susceptibilities 


Analogously to the specific heats of liquids, we define here the specific heats 
Cy and Cy (at constant M and H) as® 


__ (as PA 
cust (5) = to) (6.1.20a) 
Os OE O°F 
=T( = =-—T ; .1.20b 
ve (sr), Gar (an) niece) 


Instead of the compressibilities as for liquids, in the magnetic case one has 
the isothermal susceptibility 


OM 1 (OF 
eee en oa 


8 To keep the notation simple, we will often write H and M as H and M, making 
the assumption that M is parallel to H and that H and M are the components 
in the direction of H. 
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and the adiabatic susceptibility 


aM 1 (@E 
x= ( ) = ( ) (6.1.21b) 
dH), V\@H?), 


In analogy to Chap. 3, one finds that 


Cry — Cu = TV ay,/Xr ; (6.1.22a) 
xr —xXs = TVo%,/Cy (6.1.22b) 
and 
Cu _ xr (6.1.22c) 
Cu xs 


Here, we have defined 


ay = Cae (6.1.23) 


Eq. (6.1.22a) can also be rewritten as 


Cry —— Cu = TVajy XT ; (6.1.22d) 
where 
OH QH 
eof [ee es, 6.1.22 
oe ( oT ) M XT ( °) 
was used. 


*6.1.2.3 Stability Criteria and the Convexity of the Free Energy 


One can also derive inequalities of the type (3.3.5) and (3.3.6) for the mag- 
netic susceptibilities and the specific heats: 


xr = 0, Cy > 0 and Cu >0. (6.1.24a,b,c) 


To derive these inequalities on a statistical-mechanical basis, we assume that 
the Hamiltonian has the form 


H= Hy —p-H, (6.1.25) 


where H thus enters only linearly and 4 commutes with H. It then follows 
that 


—s : (Se). = > (aH 0) = E(u (u))?) = 0 (6.1.26a) 
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0 O Tr e PCF 1 2 
= (sa ») & Tre~ x i" pra \t— GO) ) 2 
(6.1.26b) 

with which we have demonstrated (6.1.24a) and (6.1.24b). Eq. (6.1.24c) can 
be shown by taking the second derivative of 

A(T,M) = F(T, H)+ HM 
with respect to the temperature at constant M (problem 6.1). As a result, 
F(T, H) is concave? in T and in H, while A(T, M) is concave in T and convex 
in Mi (St) , =~ SO, (Sa), = (Bitln = Ver 20): 

In this derivation, we have used the fact that the Hamiltonian H( has the 


general form (6.1.25), and therefore, diamagnetic effects (proportional to H”) 
are negligible. 


Remark: In analogy to the extremal properties treated in Sect. 3.6.4, the canonical 
free energy F' for fixed T and H in magnetic systems strives towards a minimal 
value, as does the Helmholtz free energy A for fixed T and M. At these minima, 
the stationarity conditions 6F' = 0 and 6A = 0 hold, i-e:: 

dF <0 when T and H are fixed, and dA < 0 when T and M are fixed. 


6.1.2.4 Internal Energy 


E = (4H) is the internal energy, which is found in a natural manner from 
statistical mechanics. It contains the energy of the material including the 
effects of the electromagnetic field, but not the field energy itself. It is usual 
to introduce a second internal energy, also, which we denote by U and which 
is defined as 


U=E+M.-H; (6.1.27a) 
it thus has the complete differential 
dU =TdS+HdM. (6.1.27b) 
From this, we derive 
OU OU 
T=(— H = | —— 6.1.27 
(5), ° ¥= (ane), yo 
and the Maxwell relation 
OH OT 
— =|]. 6.1.28 
(33),.> (axa), ae 


° See also R.B. Griffiths, J. Math. Phys. 5, 1215 (1964). In fact, it is 
sufficient for the proof of (6.1.24a) to show that pw enters H linearly. 
Cf. M.E. Fisher, Rep. Progr. Phys. XXX, 615 (1967), p. 644. 


276 6. Magnetism 


Remarks: 


(i) As was emphasized in footnote 5, throughout this chapter the particle 
number and the volume are treated as fixed. In the case of variable volume 
and variable particle number, the generalization of the First Law takes on 
the form 


dU = TdS — PdV + pdN + HdM (6.1.29) 
and, correspondingly, 


dE = TdS — PdV + dN —MdH. (6.1.30) 


The grand potential 
&(T, V, w,H) = —kT log Tre~ 8-4) (6.1.31a) 


then has the differential 


d® = —SdT — PdV — pdN — MdH , (6.1.31b) 


where the chemical potential 4 is not to be confused with the microscopic 
magnetic moment jz. 

(ii) We note that the free energies of the crystalline solid are not rotation- 
ally invariant, but instead are invariant only with respect to rotations of 
the corresponding point group. Therefore, the susceptibility xi; = om is 
a second-rank tensor. In this textbook, we present the essential statistical 
methods, but we forgo a discussion of the details of solid-state physics or ele- 
ment specific aspects. The methods presented here should permit the reader 
to master the complications which arise in treating real, individual problems. 


6.1.3 Supplementary Remarks 


(i) The Bohr-van Leeuwen Theorem. 
The content of the Bohr—van Leeuwen theorem is the nonexistence of mag- 
netism in classical statistics. 
The classical partition function for a charged particle in the electromagnetic 
field is given by 


N N 
_ fap Sane _s¢({pi—e Aaa) }.{x0}) /AT 
ana (6.1.32) 
TT : 


cl 


Making the substitution pi = p; — £A(x;), we can see that Z,; becomes 


independent of A and thus also of H. Then we have M = ot = 0, and 
x= -+ a = 0. Since the spin is also a quantum-mechanical phenomenon, 
dia-, para-, and ferromagnetism are likewise quantum phenomena. One might 
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ask how this statement can be reconciled with the ‘classical’ Langevin para- 
magnetism which will be discussed below. In the latter, a large but fixed 
value of the angular momentum is assumed, so that a non-classical feature 
is introduced into the theory. In classical physics, angular momenta, atomic 


radii, etc. vary continuously and without limits.!° 
(ii) Here, we append the simple intermediate computations leading to (6.1.3)— 


6.1.5). In (6.1.3), we need to evaluate —c-*.. The first term in (6.1.1) evidently 
5A (2) 


leads to the first term in (6.1.3). In the component of the current ja, taking the 
derivative of the second term leads to 


spin spin 6) 
wah pe”. curl A(x;) = mab Mig € 376 aa As (xi) = 
iy 


N 
= =e Spies me 6 (x — xi) das =C a rot [wir (x - ~)]) : 


i=l 


Pairs of Greek indices imply a summation. Since the derivative of the third term in 
(6.1.1) yields zero, we have demonstrated (6.1.3). 

(iii) In (6.1.4), the first term is obtained in a readily-apparent manner from the 
first term in (6.1.3). For the second term, we carry out an integration by parts and 
use O5%g = 55g, obtaining 


N 
> (/ d’xx x curl [uP 5 &-x)]) = 
i=1 a 


N 

1 spin 

5 ) [ee Eapy UB ey5p O5 [ue 6 (x— x:)] = 
i=1 


N 
1 in 
~3 S fe T €aBy Ey5p Osabip d(x—xi) = 
i=l 


N 


N 
ay; / dx (—28ap) poP6 (x — xi) =o yen, 
i=l 


with which we have demonstrated (6.1.4). 

(iv) Finally, we show the validity of (6.1.5). 

We can write the vector potential of a uniform magnetic field in the form 

A = 3H x x, since curlA = }(H(V-x)-—(H-V)x) yields H. To obtain the 
deroutae, we use te5a7 Xir for eis derivative with respect to Ha after the second 


2 
equals are below, finding 


OH 2 e e Oo ol 
—_ 4 A) ( ) Opt HH iT spin = 
OH. 24 am (p Cte c/ OH. pee Wirt Bio 


e al e€ 
= ce (~~ (e-£a)), rt 
Cc 
i=1 


which is in fact the right-hand side of (6.1.4). 


(6.1.33) 


10 A detailed discussion of this theorem and the original literature citations are to 
be found in J.H. van Vleck, The Theory of Electric and Magnetic Susceptibility, 
Oxford, University Press, 1932. 
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In the Hamiltonian (6.1.1), Weou, contains the mutual Coulomb interaction 
of the electrons and their interactions with the nuclei. The thermodynamic 
relations derived in Sect. (6.1.2) are thus generally valid; in particular, they 
apply to ferromagnets, since there the decisive exchange interaction is merely 
a consequence of the Coulomb interactions together with Fermi—Dirac statis- 
tics. 

In addition to the interactions included in (6.1.1), there are also the mag- 
netic dipole interaction between magnetic moments and the spin-orbit inter- 
action,!! which lead among other things to anisotropy effects. The derived 
thermodynamic relations also hold for these more general cases, whereby 
the susceptibilities and specific heats become shape-dependent owing to the 
long-range dipole interactions. In Sect. 6.6, we will take up the effects of 
the dipole interactions in more details. For elliptical samples, the internal 
magnetic field is uniform, H; = H — DM, where D is the demagnetizing 
tensor (or simply the appropriate demagnetizing factor, if the field is applied 
along one of the principal axes). We will see that instead of the susceptibil- 
ity with respect to the external field H, one can employ the susceptibility 
with respect to the macroscopic internal field, and that this susceptibility 
is shape-independent.!? In the following four sections, which deal with basic 
statistical-mechanical aspects, we leave the dipole interactions out of con- 
sideration; this is indeed quantitatively justified in many situations. In the 
next two sections 6.2 and 6.3, we deal with the magnetic properties of non- 
interacting atoms and ions; these can be situated within solids. The angular 
momentum quantum numbers of individual atoms in their ground states are 
determined by Hund’s rules.'8 


6.2 The Diamagnetism of Atoms 


We consider atoms or ions with closed electronic shells, such as for exam- 
ple helium and the other noble gases or the alkali halides. In this case, 
the quantum numbers of the orbital angular momentum and the total spin 
in the ground state are zero, S = 0 and L = 0, and as a result the to- 
tal angular momentum J = L +S is also J = 0.4 Therefore, we have 


1 The spin-orbit interaction « LS leads in effective spin models to anisotropic 
interactions. The orbital angular momentum is influenced by the crystal field of 
the lattice, transferring the anisotropy of the lattice to the spin. 

” For non-elliptical samples, the magnetization is not uniform. In this case, ou 
depends on position within the sample and has only a local significance. It is 


then expedient to introduce a total susceptibility XP's = (SY), Pe which differs 


from (6.1.33) in the homogeneous case only by a factor of V. 

13 See e.g. QM I, Chap. 13 and Table 1.12 

4 The diamagnetic contribution is also present in other atoms, but in the mag- 
netic fields which are available in the laboratory, it is negligible compared to the 
paramagnetic contribution. 
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L|0) = S|0) = J |0) = 0, where |0) designates the ground state. The param- 
agnetic contribution to the Hamiltonian (6.1.7) thus vanishes in every order 
of perturbation theory. It suffices to treat the remaining diamagnetic term 
in (6.1.7) in first-order perturbation theory, since all the excited states lie at 
much higher energies. Owing to the rotational symmetry of the wavefunc- 
tions of closed shells, we find (0| >>; (a7 + y?) 0) = $(0| 0,7? |0) and, for 
the energy shift of the ground state, 


pee (0) S>r? [0) . (6.2.1) 
12mc? : : 


From this it follows for the magnetic moment and the susceptibility of a single 
atom: 


OE, e? (0| >, r? |0) O(pz) e* (0| 0,77 10) 


z= — A, = — 
(Hz) OH 6mc? x OH 6mc? 


(6.2.2) 


where the sums run over all the electrons in the atom. The magnetic moment 
is directed oppositely to the applied field and the susceptibility is negative. 
We can estimate the magnitude of this so called Langevin diamagnetism using 
the Bohr radius: 

D5 CAU OR 10 te oe 


= aS -30,.,,3 
X= ——B49-27 x 10a cm” = —5 x 10°°"cm”, 


3 3 


y per mole = —5 x 10729 x 6 x 103 = ww 3 x 19-8 
mole mole 


The experimental values of the molar susceptibility of the noble gases are 
collected in Table 6.1. 


Table 6.1. Molar susceptibilities of the noble gases 


He Ne Ar Kr Xe 


xin 10~&cm3/mole  -1.9 -7.2 -15.4 -28.0  -43.0 


An intuitively apparent interpretation of this diamagnetic susceptibility runs 
as follows: the field H induces an additional current Aj = —erAw, whereby the 
orbital frequency of the electronic motion increases by the Larmor frequency Aw = 
oe The sign of this change corresponds to Lenz’s law, so that both the magnetic 
moment jz and the induced magnetic field are opposite to the applied field H: 


rAj r? Awe er? 
2c 2c 4mc? ° 


Me ~ 


We also note that the result (6.2.2) is proportional to the square of the Bohr radius 
and therefore to the fourth power of h, confirming the quantum nature of magnetic 
phenomena. 
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6.3 The Paramagnetism of Non-coupled Magnetic 
Moments 


Atoms and ions with an odd number of electrons, e.g. Na, as well as atoms 
and ions with partially filled inner shells, e.g. Mn?+, Gd", or U4* (transition 
elements, ions which are isoelectronic with transition elements, rare-earth and 
actinide elements) have nonvanishing magnetic moments even when H = 0, 


e€ 


w= 5 (L+g.8) = oa (J+8) (ge=2). (6.3.1) 


Here, J = L +S is the total angular momentum operator. For relatively low 
external magnetic fields (i.e. ehH/mc < spin-orbit coupling)) with H applied 
along the z-axis, the theory of the Zeeman effect’> gives the energy-level shifts 


AEm, = gupM A ; (6.3.2) 
where M, runs over the values My = —J,...,J1® and the Landé factor 
J(J+1)+ S(9 + 1) — L(L 41) 
=1 6.3.3 
ee I(T +1) (os) 
was used. Familiar special cases are D = 0 : g = 2,M; = Ms = +3 and 


S=0: g=1,M;=M,=-L,...,L. 
The Landé factor can be made plausible in the classical picture where L and S 


precess independently around the spatially fixed direction of the constant of the 
motion J. Then we find: 


(L+28), =~ “al Be u ge 7s 
2%, (+ es | 
The partition function then becomes 
: 7 sinh n (2J +1) /2\% 
a SS a) 7 (= ae 2 ) (6.3.4) 
with the abbreviation 
: oT . (6.3.5) 


Here, we have used the fact that 


' Cf. e.g. QM I, Sect. 14.2 
 J=L+8, Jz |m;) =hm,;|m,;) 
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J 2d QI+1 - 
S- enn™ — e- Sve” =e eal = sinh ara: 1) /2 : 
e7 — 1 sinh 7/2 
m=—J r=0 
For the free energy, we find from (6.3.4) 
sinh 7 (2J + 1) /2 
F(T, H)=-kTN\1 3. 
(7, H) = -KTW tog { NEI? Y (6.3.6) 
from which we obtain the magnetization 
1 OF 
M=- SS = JB 6.3.7 
Van @ Toes Bs (n) (6.3.7) 


(n= x). The magnetization is oriented parallel to the magnetic field H. In 


Eq. (6.3.7) we have introduced the Brillouin function By, which is defined 
as 


By(n) = “ {(u+ 5) coth n(J + 5) a 5 coth 2 (6.3.8) 
(Fig. 6.1). We now consider the asymptotic limiting cases: 
n—0: cothn = —+ 2+ 0 (n8), By(n) = Fn +0 (1) 
(6.3.9a) 
and 
Noo: By(ow)=1 (6.3.9b) 


Brillouin function B (x / J) 


“oe 1 2 3 4 5 6 


Fig. 6.1. The Brillouin function for J = 1/2,1,3/2,2,00 as a function of x = 
ae = J. For classical momentsB. is identical to the Langevin function 
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Inserting (6.3.9a) into (6.3.7), we obtain for low applied fields (H « kT/Jgpp) 


= 2J(J+1)H 
M =n (gps) 3hr (6.3.10a) 
while from (6.3.9b), for high fields (H >> kT’/Jgpp,), we find 
M=ngppJt (6.3.10b) 


This signifies complete alignment (saturation) of the magnetic moments. An 
important special case is represented by spin-$ systems. Setting J = $ in 
(6.3.8), we find 


2 127 n 
n (cosh 3 + sinh i) cosh 5 n 
B = 2coth h== =tanh-—. 
3 (”) ee 2 sinh 4 cosh 4 sinh 4 a 2 
(6.3.11) 


This result can be more directly obtained by using the fact that for spin 
S = 1/2, the partition function of a spin is given by Z = 2coshn/2 and 
the average value of the magnetization by M = ngypZ~! sinhn/2. Letting 
J = ©, while at the same time gup — 0, so that uw = gupJ remains finite, 
we find 


1 
Bx (n) = cothnd — wo = coth —— (6.3.12a) 


J kT pH 
Bo (n) is called the Langevin function for classical magnetic moments pu; 
together with (6.3.7), it determines the magnetization 


x a ee 
M=np (com iT =) (6.3.12b) 


of “classical” magnetic moments of magnitude yw. A classical magnetic mo- 
ment pcan be oriented in any direction in space; its energy is EF = —yH cosv, 
where ¥ is the angle between the field H and the magnetic moment py. The 
classical partition function for one particle is Z = [ dQ e~®/kT and leads via 
(6.1.9b) once again to (6.3.12b). Finally, for the susceptibility we obtain 


x =n (915)? Bin) (6.3.13) 


In small magnetic fields H « Sane this gives the Curie law 


oy SIA) 


ae (6.3.14) 


XCurie = 1 (gu) 


The magnetic behavior of non-coupled moments characterized by (6.3.7), 
(6.3.13), and (6.3.14) is termed paramagnetism. The Curie law is typical of 
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0 2 4 §kT/sH 0 2 4 SkT/usH 


0.2 


Fig. 6.2. The entropy, the in- 
ternal energy, and the specific 
2 2 4kT/ HpH heat of a spin-4 paramagnet 


preexisting elementary magnetic moments which need only be oriented by the 
applied field, in contrast to the polarization of harmonic oscillators, whose 
moments are induced by the field (cf. problem 6.4). 

We include a remark about the magnitudes. The diamagnetic susceptibil- 
ity per mole, from the estimate which follows Eq. (6.2.2), is equal to about 
ymele x —10~-5cm?/mole. The paramagnetic susceptibility at room temper- 
ature is roughly 500 times larger, ie. y° = 1072-1073cm/mole. The 
entropy of a paramagnet is 


OF sinh 22/4) 
oie -(FF). ae ( oa —JBy(n) | (6.3.15) 
For spin 5, (6.3.15) simplifies to 
HpH\ — pH ppl 
= Nk {1 2 h tanh 31 
S (io ( cosh TF ) yp tanh Te ) (6.3.16) 


with the limiting case 
S=WNklog2 for H—-0O. (6.3.16’) 


The entropy, the internal energy, and the specific heat of the paramagnet 
are reproduced in Figs. 6.2a,b,c. The bump in the specific heat is typical of 
2-level systems and is called a Schottky anomaly in connection with defects. 
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Van Vleck paramagnetism: The quantum number of the total angular momen- 
tum also becomes zero, J = 0, when a shell has just one electron less than half full. 
In this case, according to Eq. (6.3.2) we have indeed (0| J +S |0) = 0, but the para- 
magnetic term in (6.1.7) yields a nonzero contribution in second order perturbation 
theory. Together with the diamagnetic term, one obtains for the energy shift of the 
ground state 


|(0| (L+2S8)-H|n)/? 
an eT —s 7 (OS (c? +y?)|0). (6.3.17) 


The first, paramagnetic term, named for van Vieck!”, which also plays a role in the 
magnetism of molecules!®, competes with the diamagnetic term. 


6.4 Pauli Spin Paramagnetism 


We consider now a free, three-dimensional electron gas in a magnetic field 
and restrict ourselves initially to the coupling of the magnetic field to the 
electron spins. The energy eigenvalues are then given by Eq. (6.1.7): 

Bee ptl 


SS it aie 6.4.1 
ept = 5 + 5 gelB (6.4.1) 


The energy levels are split by the magnetic field. Electrons whose spins are 
aligned parallel to the field have higher energies, and these states are therefore 
less occupied (see Fig. 6.3). 


spin magn. moment 


f ; 
' t 


(a) (b) 


Fig. 6.3. Orientation (a) of the spins, and (b) of the magnetic moments. (c) The 
energy as a function of p (on the left for positive spins and on the right for negative 
spins) 


7 JH. van Vleck, The Theory of Magnetic and Electric Susceptiblities, Oxford 
University Press, 1932. 

18 Ch. Kittel, Introduction to Solid State Physics, Third edition, John Wiley, New 
York, 1967 
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The number of electrons in the two states is found to be 


Co 


4 | ee | 1 1 
Ni = d? So — gyiph. |= yide= + -goppH ) , 
+= Gaps / pn( 5 Jel ) if € sHon(« 5 9eHB ) 
0 
(6.4.2) 
where the density of states has been introduced: 
gV / F sere 
v(e) = d°pd(e—€,) = N=; ; 6.4.3 
( ) (27h)? Pp ( p) 2 A? ( ) 
it fulfills the normalization condition fj" dev(e) = N. In the case that 
golpH «K pw & er, we can expand in terms of H: 
7 d / 1 2 
Ni = | de 14) n(e) +n’ (e) x 9ehBH + O(H’)| . (6.4.4) 
0 
For the magnetization, using the above result we obtain: 
= = owed / 3 
M = —pp(Ny—N_)/V = rage dev(e)n'(e)+O(H*) , — (6.4.5) 
0 
where we have set g. = 2. For T — 0, we find from (6.4.5) the magnetization 
3 NH 
M = piv(er)H/V + O (H?) = Se tee + O (H®) (6.4.6) 
F 
and the magnetic susceptibility 
32 N 2 
= =p : A, 
XP sib, +O (H ) (6.4.7) 


This result describes the phenomenon of Pauli spin paramagnetism. 
Supplementary remarks: 
(i) For T # 0, we must take the change of the chemical potential into 


account, making use of the Sommerfeld expansion: 


N= f dev(e)n(c) +0 (H?) = f acto pe un v' (tt) 4 OH? E-) 
0 0 


= f dev(e) + (ner) v(er) + PANTY Ver) +0 (2,74 . (6.4.8) 
0 


Since the first term on the right-hand side is equal to N, we write 


_ m(kT)* v' (er) é 
ep = a ven + O(H?,T*) . (6.4.9) 
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Integrating by parts, we obtain from (6.4.5) and (6.4.9) 


M= = i v'(e)n(e) + O (H®) 
0 
= sete v(m) 4 i PY a) + O (H?,T*) (6.4.10) 
= Hee se T en) ee "(er)) +0 (H3,T*) . 


(ii) The Pauli susceptibility (6.4.7) can be interpreted similarly to the linear 
specific heat of a Fermi gas (see Sect. 4.3.2): 


Ve 
KP = xounie ar _ Lp v(er)/V . (6.4.11) 


Naively, one might expect that the susceptibility of N electrons would be 
equal to the Curie susceptibility ycurie from Eq. (6.3.14) and therefore would 
diverge as 1/T. It was Pauli’s accomplishment to realize that not all of the 
electrons contribute, but instead only those near the Fermi energy. The num- 
ber of thermally excitable electrons is kTv(ep). 

(iii) The Landau quasiparticle interaction (see Sect. 4.3.3e, Eq. 4.3.29c) 
yields 


LB (er) 


VOLE’ (6.4.12) 


XP = 
In this expression, F, is an antisymmetric combination of the interaction 
parameters.1? 
(iv) In addition to Pauli spin paramagnetism, the electronic orbital motions 
give rise to Landau diamagnetism 7° 


e? kp 
= -—— > 6.4.13 
oe 120?mc? ( ) 
For a free electron gas, vy, = — 3X p. The lattice effects in a crystal have differ- 


ing consequences for yy, and yp. Eq. (6.4.13) holds for free electrons neglect- 
ing the Zeeman term. The magnetic susceptibility for free spin-4 fermions is 
composed of three parts: it is the sum 


X=xP+XL+XOsc - 


19 D. Pines and Ph. Noziéres, The Theory of Quantum Liquids Vol. I: Normal Fermi 
Liquids, W. A. Benjamin, New York 1966, p. 25 


20 See e.g. D. Wagner, Introduction to the Theory of Magnetism, Pergamon Press, 
Oxford, 1972. 
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XOsc 1s an oscillatory part, which becomes important at high magnetic fields 
HT and is responsible for de Haas—van Alphen oscillations. 

(v) Fig. 6.3c can also be read differently from the description given above. 
If one introduces the densities of states for spin th/2 


Oe / dp 5(€— eps) 


(2nh)” 
= ae [ #s(e- Ce at dott #1) 
= —s J wve(« - 5 dott) a(0 - [2m om seus) ) 
0 
= Nap e(c = 5 dott) (« - 5 deltn H) i , 


then the solid curves which are drawn on the left and the right also refer to 
v,(e) and v_(e). 
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6.5.1 The Exchange Interaction 


Ferromagnetism and antiferromagnetism are based on variations of the ex- 
change interaction, which is a consequence of the Pauli principle and the 
Coulomb interaction (cf. the remark following Eq. (6.1.33)). In the simplest 
case of the exchange interaction of two electrons, two atoms or two molecules 
with the spins 8; and Sg, the interaction has the form +J S,-Se2, where J 
is a positive constant which depends on the distance between the spins. The 
exchange constant +J is determined by the overlap integrals, containing the 
Coulomb interaction.24 When the exchange energy is negative, 


B= —JS, : So ; (6.5.1a) 


then a parallel spin orientation is favored. This leads in a solid to ferro- 
magnetism (Fig. 6.4b); then below the Curie temperature T., a spontaneous 
magnetization occurs within the solid. When the exchange energy is positive, 


E=JS8,-S2, (6.5.1b) 


then an antiparallel spin orientation is preferred. In a suitable lattice struc- 
ture, this can lead to an antiferromagnetic state: below the Néel temperature 
Tn, an alternating (staggered) magnetic order occurs (Fig. 6.4c). Above the 


21 See Chaps. 13 and 15, QM I 
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Fig. 6.4. A crystal lattice of magnetic ions. The spin S; is located at the posi- 
tion x;, and | denumerates the lattice sites. (a) the paramagnetic state; (b) the 
ferromagnetic state; (c) the antiferromagnetic state. 
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respective transition temperature (Tc or Ty), a paramagnetic state occurs 
(Fig. 6.4a). The exchange interaction is, to be sure, short-ranged; but owing 
to its electrostatic origin it is in general considerably stronger than the dipole- 
dipole interaction. Examples of ferromagnetic materials are Fe, Ni, EuO; and 
typical antiferromagnetic materials are MnF2 and RbMnFs. 

In the rest of this section, we turn to the situation described by equation 
(6.5.1a), i.e. to ferromagnetism, and return to (6.5.1b) only in the discussion 
of phase transitions. We now imagine that the magnetic ions are located on 
a simple cubic lattice with lattice constant a, and that a negative exchange 
interaction (J > 0) acts between them (Fig. 6.4a). The lattice sites are enu- 
merated by the index /. The position of the [th ion is denoted by x; and its 
spin is S;. All the pairwise interaction energies of the form (6.5.1a) contribute 
to the total Hamiltonian ??: 


1 
H=—-5 So Jw Si: Su (6.5.2) 


Lv 


Here, we have denoted the exchange interaction between the spins at the 
lattice sites | and lI’ by Jy. The sum runs over all / and 1’, whereby the 
factor 1/2 guarantees that each pair of spins is counted only once in (6.5.2). 
The exchange interaction obeys Jy = Jj, and we set Jj; = 0 so that we 
do not need to exclude the occurrence of the same /-values in the sum. The 
Hamiltonian (6.5.2) represents the Heisenberg model 7°. Since only scalar 
products of spin vectors occur, it has the following important property: H 


22 In fact, there are also interactions within a solid between more than just two 
spins, which we however neglect here. 

23 The direct exchange described above occurs only when the moments are near 
enough so that their wavefunctions overlap. More frequently, one finds an in- 
direct exchange, which couples more distant moments. The latter acts via an 
intermediate link, which can be a quasi-free electron in a metal or a bound elec- 
tron in an insulator. The resulting interaction is called in the first case the RKKY 
(Rudermann, Kittel, Kasuya, Yosida) interaction and in the second, it is referred 
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is invariant with respect to a common rotation of all the spin vectors. No 
direction is especially distinguished and therefore the ferromagnetic order 
which can occur may point in any arbitrary direction. Which direction is 
in fact chosen by the system is determined by small anisotropy energies or 
by an external magnetic field. In many substances, this rotational invariance 
is nearly ideally realized, e.g. in EuO, EuS, Fe and in the antiferromagnet 
RbMnFs3. In other cases, the anisotropy of the crystal structure may have 
the effect that the magnetic moments orient in only two directions, e.g. along 
the positive and negative z-axis, instead of in an arbitrary spatial direction. 
This situation can be described by the Ising model 


1 z z 
H=-5 » Tw S? Se. (6.5.3) 


This model is considerably simpler than the Heisenberg model (6.5.2), since 
the Hamiltonian is diagonal in the spin eigenstates of S7. But even for (6.5.3), 
the evaluation of the partition function is in general not trivial. As we shall 
see, the one-dimensional Ising model can be solved exactly in an elementary 
way for an interaction restricted to the nearest neighbors. The solution of the 
two-dimensional model, i.e. the calculation of the partition function, requires 
special algebraic or graph-theoretical methods, and in three dimensions the 
model has yet to be solved exactly. When the lattice contains N sites, then 
the partition function Z = Tr e~%** has contributions from all together 2% 
configurations (every spin can take on the two values t+h/2 independently of 
all the others). A naive summation over all these configurations is possible 
even for the Ising model only in one dimension. In order to understand the 
essential physical effects which accompany ferromagnetism, in the next sec- 
tion we will apply the molecular field approximation. It can be carried out 
for all problems related to ordering. We will demonstrate it using the Ising 
model as an example. 


6.5.2 The Molecular Field Approximation for the Ising Model 


We consider the Hamiltonian of the Ising model in an external magnetic field 


1 
Has ye J(L-V) ovo —hS oon. (6.5.4) 


Lv 1 


to as superexchange (see e.g. C. M. Hurd, Contemp. Phys. 23, 469 (1982)). Also 
in cases where direct exchange is not predominant and even for itinerant magnets 
(with 3d and 4s electrons which are not localized, but instead form bands), the 
magnetic phenomena, in particular their behavior near the phase transition, can 
be described using an effective Heisenberg model. A derivation of the Heisenberg 
model from the Hubbard model can be found in D.C. Mattis, The Theory of 
Magnetism, Harper and Row, New York, 1965. 
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In comparison to (6.5.3), equation (6.5.4) contains some changes of notation. 
Instead of the spin operators Sj’, we have introduced the Pauli spin matrices 
o; and use the eigenstates of the o/ as basis functions; their eigenvalues are 


o, =+1 forevery!. 


The Hamiltonian becomes simply a function of (commuting) numbers. By 
writing the exchange interaction in the form J(I-—I’) (J(-U) = J(U-D = 
Jyh? /4, J(0) = 0), we express the fact that the system is translationally 
invariant, i.e. J(J—1’) depends only on the distance between the lattice sites. 
The effect of an applied magnetic field is represented by the term —h 7, 07. 
The factor —t9up has been combined with the magnetic field H into h = 
a s guBH; the sign convention for h is chosen so that the a are aligned parallel 
to it. 

Due to the translational invariance of the Hamiltonian, it proves to be 
expedient for later use to introduce the Fourier transform of the exchange 
coupling, 


k= Oe: (6.5.5) 
I 


Frequently, we will require J (k) for small wavenumbers k. Due to the finite 
range of J(i — 1’), we can expand the exponential functions in (6.5.5) 


J(k) = So J() - Ss (k+x))"J(l) +... . (6.5.5’) 
I I 


For cubic and square lattices, and in general when reflection symmetry is 
present, the linear term in k makes no contribution. 

We can interpret the Hamiltonian (6.5.4) in the following manner: for 
some configuration of all the spins oj, a local field 


hp=h+ > J(L=U) ov (6.5.6) 
7 

acts on an arbitrarily chosen spin o7. If hj; were a fixed applied field, we could 
immediately write down the partition function for the spin o;. Here, however, 
the field hy depends on the configuration of the spins and the value of a; itself 
enters into the local fields which act upon its neighbors. In order to avoid this 
difficulty by means of an approximation, we replace the local field (6.5.6) by 
its average value, i.e. by the mean field 


(ly) =h+S > IU=U)(ov) =h+ JO)m. (6.5.7) 
7 


In the second part of this equation, we have introduced the average value 


m= (oi), (6.5.8) 
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which is position-independent, owing to the translational invariance of the 
Hamiltonian; thus m refers to the average magnetization per lattice site (per 
spin). Furthermore, we use the abbreviation 


J=5(0) =o Jl) (6.5.9) 
l 


for the Fourier transform at k = 0 (see (6.5.5’)). Eq. (6.5.7) contains, in 
addition to the external field, the molecular field Jm. The density matrix 
then has the simplified form 


px [leer : 
q 


Formally, we have reduced the problem to that of a paramagnet, where the 
molecular field must still be determined self-consistently from the magneti- 
zation (6.5.8). 

We still want to derive the molecular field approximation, justified above 
with intuitive arguments, in a more formal manner. We start with an ar- 
bitrary interaction term in (6.5.4), —J(l — l’)o1oy, and rewrite it up to a 
prefactor as follows: 


gov = ((o1) +o, — (01)) ((ov) + oy — (ov)) 
= (01) (ov) + (01) (ov — (ov) (6.5.10) 
+ (ov) (or on (01) + (a1 > (01) (ov = (ov)) < 
Here, we have ordered the terms in powers of the deviation from the mean 


value. We now neglect terms which are nonlinear in these fluctuations. This 
yields the following approximate replacements: 


aor — —(o1)(ov) + (or)ov + (or)or , (6.5.10') 


which lead from (6.5.4) to the Hamiltonian in the molecular field approxima- 
tion 
1 ~ 2 
Hurr = 5m N J(0) — S° o(h + J(O)m) . (6.5.11) 
l 


We refer to the Remarks for comments about the validity and admissibility 
of this approximation. With the simplified Hamiltonian (6.5.11), we obtain 
the density matrix 


pup = Zhe ee) are IN| (6.5.12) 
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and the partition function 


Zurr = Ty 8 [1 ou(ht+Jm)— 5m? IN] = II ( S- nit eo 3hm? IN 
I 


o,=t1 


(6.5.13) 


in the molecular field approximation, where Tr = > fap=eit* We thus find 
for (6.5.13) 


, - .\N 
Zurr = (o-b6m"Jp cosh 3(h + Jm)) (6.5.13’) 


Using m = HkTS log Zmrr, we obtain the equation of state in the molecular 
field approximation: 


m = tanh (G(Jm + h)) , (6.5.14) 


which is an implicit equation for m. Compared to the equation of state of 
a paramagnet, the field h is amplified by the internal molecular field Jm. 
As we shall see later, (6.5.14) can be solved analytically for h. It is however 
instructive to solve (6.5.14) first for limiting cases. To do this, it will prove 
expedient to introduce the following abbreviations: 


_T-T, 
=. 


and + (6.5.15) 
We will immediately see that J. has the significance of the transition tem- 
perature, the Curie temperature. Above T,, the magnetization is zero in the 
absence of an applied field; below this temperature, it increases continuously 
with decreasing temperature to a finite value. We first determine the behav- 
ior in the neighborhood of T., where we can expand in terms of 7,h and m. 


a) h=0: 
For zero applied field and in the vicinity of T., (6.5.14) can be expanded in 
a Taylor series, 


ee. pe Ne 
m = tanh BJm = —m s( m) f... (6.5.16) 


which can be cut off at the third order so as to retain the leading term of the 
solution. The solutions of (6.5.16) are 


m=0 for T>T (6.5.17a) 


and 


m=+m, m= V3(—7)/? for T<T. (6.5.17b) 
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The first solution, m = 0, is found for all temperatures, the second only for 
T <T, i.e. 7 < 0. Since the free energy of the second solution is smaller (see 
below and in Fig. 6.9), it is the stable solution below T,. From these consid- 
erations we find the temperature ranges given in (6.5.17). For T < T,, the 
spontaneous magnetization, denoted as mo, is observed (6.5.17b); it follows 
a square-root law (Fig. 6.5). This quantity is called the order parameter of 
the ferromagnetic phase transition. 


Fig. 6.5. The spontaneous magnetization 
(solid curve), and the magnetization in an 
applied field (dashed). The spontaneous 
magnetization in the Ising model has two 
possible orientations, +mo or —mo 


-Mo 


b) h and 7 nonzero: 
for small h and 7 and thus small m, the expansion of (6.5.14) 


,_r\_ bh _lfh te = 
a To he? (BRE Ey Ss 


leads to the magnetic equation of state 


h zs 1 5 
=TmM+= 
kT. a 
in the neighborhood of T,. An applied magnetic field produces a finite magne- 
tization even above T, and leads qualitatively to the dashed curve in Fig. 6.5. 


(6.5.18) 


c)T=0: 
exactly at T., we find from (6.5.18) the critical isotherm: 
gh \ 1/3 ; 
— ~N 4 . 1 
m (=) , hem (6.5.19) 


d) Susceptibility for small r: 
we now compute the isothermal magnetic susceptibility y = (St) by dif- 
ferentiating the equation of state (6.5.18) with respect to h 


iT. TX +m? . (6.5.20) 


In the limit h — 0, we can insert the spontaneous magnetization (6.5.17) into 
(6.5.20) and obtain for the isothermal magnetic susceptibility 
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X kTe 


susceptibility 


Fig. 6.6. The magnetic suscep- 
tibility (6.5.21): the Curie-Weiss 


law 


_1/k Pee 
1/kT. (p=T) 
es S : (6.5.21) 
2 
T+m 1/k 
———— T<T 
XT. —T) 


this is the Curie-Weiss law shown in Fig. 6.6. 

Remark: We can understand the divergent susceptibility at T, by starting from 

the Curie law for paramagnetic spins (6.3.10a), adding the internal molecular field 

Jm to the field h, and then determining the magnetization from it: 

ae 
kT 


L/k 


m = 
ee ee 


(h+ Jm) (6.5.22) 


Following these limiting cases, we solve (6.5.14) generally. We first discuss 
the graphical solution of this equation, referring to Fig. 6.7. 

e) A graphical solution of the equation m = tanh((h + Jm)) 

To find a graphical solution, it is expedient to introduce the auxiliary variable 
y=m+ ae Then one finds m as a function of h by determining the 
intersection of the line y— 74 with tanh4y: 


m=y 


Te 
~ EE = tanh7y : 
For T > T., Fig. 6.7a exhibits exactly one intersection for each value of 
h. This yields the monotonically varying curve for T > T, in Fig. 6.8. For 
T < IT, from Fig. 6.7b the slope of tanhfzy at y = 0 is greater than 1 and 
therefore we find three intersections for small absolute values of h, while the 
solution for high fields remains unique. This leads to the function for T < T, 
which is shown in Fig. 6.8. 
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m h<0 h=0 hoo 
4s ff ‘ 


(a) (b) 


Fig. 6.8. The magnetic 
equation of state in the 
molecular field approxi- 
mation (6.5.23). The dot- 
ted vertical line on the m- 
axis represents the inho- 
mogeneous state (6.5.28) 


For small h, m(h) is not uniquely determined. Particularly noticeable is the 
fact that the S-shaped curve m(h) contains a section with negative slope, 
i.e. negative susceptibility. In order to clarify the stability of the solution, we 
need to consider the free energy. We first note that for large h, the magneti- 
zation approaches its saturation value (Fig. 6.7). 

In fact, one can immediately compute the function h(m) from Eq. (6.5.14) 
analytically, since from 


1l+m 
l1—m 


f 1 
B(Jm +h) = arctanhm = 5 log 


the equation of state 


Ec 1 
h= —-kT.m4 a log as 


follows. Its shape is shown in Fig. 6.8 for T < T, at the two values T = 0.8 T, 
and 1.27, taken as examples, in agreement with the graphical construction. 


(6.5.23) 
l1—m 
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As mentioned above, for a given field h, the value of the magnetization below 
T, is not everywhere unique; e.g. for h = 0, the three values 0 and +mo occur. 
In order to find out which parts of the equation of state are physically stable, 
we must investigate the free energy. 

The free energy in the molecular field approximation, F = —kT log Zyrr, 
per lattice site and in units of the Boltzmann constant, is given from (6.5.13’) 
by 


f(T, h) = = = sTem? — T log {2 cosh((T.m + h/k)/T)} ar 
, 5 
a 


Dd 
~ 


Te 
T—T,)m? 4 ml mh/k — Tlog2. 
We give here in the first line the complete expression, and in the second line 
the expansion in terms of m, h and T —T., which applies in the neighborhood 
of the phase transition. Here, m = m(h) must still be inserted. 
From (6.5.24), the heat capacity at vanishing applied field (for T = T.) 
can be found: 
OF 
Ch=0 = —NkT OT? 


h=0 


. iis! a 
3 T , 
gNkr T<T 


here, a jump of magnitude Acpio = 3N k is seen. We calculate directly the 
Helmholtz free energy 


a(T,m) = f +mh/k 
(6.5.25) 


= 5Tem? — T log {2cosh((T.m + h/k)/T)} + mh/k , 


in which h = h(m) is to be inserted. From the determining equation for m 
(6.5.14), it follows that 


T log {2 cosh((Tpm + h/k)/T)} = 


i 1/2 
= Tlog2+ T log (; — tank? (Tam + aan) 
= Tlog2— 5 loa( —m?’). 
Combining this with (6.5.23) and inserting into (6.5.25), we obtain 


1 1 T 1 
a(T,m) = —5Tem* — Tlog2 + 5 Ff lost m?) + > log walt 


Ba (6596) 


1 
x —Tlog2+ =(T. —T)m? 4 4. 
og 2+ 5(Lo—T)m? + mt; 
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Fig. 6.9. The Helmholtz 
free energy in the molec- 
ular field approximation 
above and below Ty, for 
T = 08T, and T = 
1.2T¢. 


here, the second line holds near T,. The Helmholtz free energy above and 
below T,. is shown in Fig. 6.9. 

We first wish to point out the similarity of the free energy for T < T, with 
that of the van der Waals gas. For temperatures T < T,, there is a region 
in a(T,m) which violates the stability criterion (6.1.24a). The magnetization 
can be read off from Fig. 6.9 using 


bak (5 (6.5.27) 
Om) 


by drawing a tangent with the slope h to the function a(T,m). Above T., 
this construction gives a unique answer; below 7, however, it is unique only 
for a sufficiently strong applied field. We continue the discussion of the low- 
temperature phase and determine the reorientation of the magnetization on 
changing the direction of the applied magnetic field, starting with a magnetic 
field h for which only a single value of the magnetization results from the 
tangent construction. Lowering the field causes m to decrease until at h = 0, 
the value mg is obtained. Exactly the same tangent, namely that with slope 
zero, applies to the point —mpo. Regions of magnetization mo and —mpo can 
therefore be present in equilibrium with each another. When a fraction c of the 
body has the magnetization —mp and a fraction 1 — c has the magnetization 
mo, then for 0 < c < 1 the average magnetization is 


m = —cmo + (1 — c)mo = (1 — 2c)mo (6.5.28) 


in the interval between —mpo and mo. 
The free energy of this inhomogeneously magnetized object is a(mo) (dot- 
ted line in Fig. 6.9), and is thus lower than the part of the molecular-field so- 
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lution which arches upwards and which corresponds to a homogeneous state 
in the coexistence region of the two states +mo and —mpo. In the inter- 
val [—mo, mo], the system does not enter the homogeneous state with its 
higher free energy, but instead breaks up into domains?* which according to 
Eq. (6.5.28) yield all together the magnetization m. We remind the reader 
of the analogy to the Maxwell construction in the case of a van der Waals 
liquid. 

For completeness, we compare the free energies of the magnetization states 
belonging to a small but nonzero h. Without loss of generality we can assume 
that h is positive. Along with the positive magnetization, for small h there are 
also two solutions of (6.5.27) with negative magnetizations. It is clear from 
Fig. 6.9 that the latter two have higher free energies than the solution with 
positive magnetization. For a positive (negative) magnetic field, the state 
with positive (negative) magnetization is thermodynamically stable. The S- 
shaped part of the equation of state (for T < T,) in Fig. 6.8 is thus replaced 
by the dotted vertical line. 

Finally, we give the entropy in the molecular field approximation: 


Ss Oa 1l+m 1l+m l1-m l—m 
i ae (FF) | 5 log pe log 5 ; (6.5.29) 


m 


it depends only on the average magnetization m. 
The internal energy is given by 


E ; _ ol 2 
e= aR 78 mh/k+Ts = 5 Tem” —mh/k. (6.5.30) 


This can be more readily seen from (6.5.11) by taking an average value (KH) 


with the density matrix (6.5.12). From h = p2athm) it again follows that 


m= tanhemtn/k i.e. we recover Eq. (6.5.14). 


Remarks: 


(i) The molecular field approximation can also be applied to other models, 
for example the Heisenberg model, and also for quite different cooperative 
phenomena. The results are completely analogous. 

(ii) The effect of the remaining spins on an arbitrarily chosen spin is replaced 
in molecular field theory be a mean field. In the case of a short-range inter- 
action, the real field configuration will deviate considerably from this mean 
value. The more long-ranged the interaction, the more spins contribute to 
the local field, and the more closely it thus approaches the average field. The 


24 The number of domains can be greater than just two. When there are only 
a few domains, the interface energy is negligible in comparison to the gain in 
volume energy; see problem 7.6. In reality, the dipole interaction, anisotropies 
and inhomogeneities in the crystal play a role in the formation of domains. 
They form in such a way that the energy including that of the magnetic field is 
minimized. 
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molecular field approximation is therefore exact in the limit of long-range 
interactions (see also problem 6.13, the Weiss model). We note here the anal- 
ogy between the molecular field theory and the Hartree-Fock theory of atoms 
and other many-body systems. 

(iii) We want to point out another aspect of the molecular field approxima- 
tion: its results do not depend at all on the dimensionality. This contradicts 
intuition and also exact calculations. In the case of short-range interactions, 
one-dimensional systems in fact do not undergo a phase transition; there are 
too few neighbors to lead to a cooperative ordering phenomenon. 

(iv) In the next chapter, we shall turn to a detailed comparison of the gas- 
liquid transition and the ferromagnetic transition. We point out here in antic- 
ipation that the van der Waals liquid and the ferromagnet show quite similar 
behavior in the immediate vicinity of their critical points in the molecular 
field approximation; e.g. (pg — pe) ~ (Te — fas and Mp ~ (T. — ioe and 
likewise, the isothermal compressibility and the magnetic susceptibility both 
diverge as (T, — T)~‘. This similarity is not surprising; in both cases, the 
interactions with the other gas atoms or spins is replaced by a mean field 
which is determined self-consistently from the ensuing equation of state. 

(v) If one compares the critical power laws (6.5.17), (6.5.19), and (6.5.21) 
with experiments, with the exact solution of the two-dimensional Ising model, 
and with numerical results from computer simulations or series expansions, 
it is found that in fact qualitatively similar power laws hold, but the critical 
exponents are different from those found in the molecular field theory. The 
lower the dimensionality, the greater the deviations found. Instead of (6.5.17), 
(6.5.19), and (6.5.21), one finds generalized power laws: 


mo ~|7|? Pe. (6.5.31a) 
mwah? T=To, (6.5.31b) 
x ~|r|7 T2T, (6.5.31c) 
ch ~ |r| ° Top. (6.5.31) 


The critical exponents 3,6, y and a which occur in these expressions in general 
differ from their molecular field values 1/2,3,1 and 0 (corresponding to the 
jump). For instance, in the two-dimensional Ising model, 3 = 1/8,6 = 15,7 
7/4, and a = 0 (logarithmic). 

Remarkably, the values of the critical exponents do not depend on the lattice 
structure, but only on the dimensionality of the system. All Ising systems 
with short-range forces have the same critical exponents in d dimensions. 
Here, we have an example of the so called universality. The critical behavior 
depends on only a very few quantities, such as the dimensionality of the 
system, the number of components of the order parameter and the symmetry 
of the Hamiltonian. Heisenberg ferromagnets have different critical exponents 
from Ising ferromagnets, but within these groups, they are all the same. With 
these remarks about the actual behavior in the neighborhood of a critical 
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point, we will close the discussion. In particular, we postpone the description 
of additional analogies between phase transitions to the next chapter. We 
now return to the molecular field approximation and use it to compute the 
magnetic susceptibility and the position-dependent spin correlation function. 


6.5.3 Correlation Functions and Susceptibility 


In this subsection, we shall consider the Ising model in the presence of a 
spatially varying applied magnetic field h;. The Hamiltonian is then given by 


KH = Ho oes = a8 J(L=1) oon Soa (6.5.32) 
The magnetization per spin at position 1 now depends on the lattice site /: 

m= (01) =Tr [e &* oj] /Tr ee P™ (6.5.33) 
We first define the susceptibility 


Om, 


ap 6.5.34 
Ohy ’ ( ) 


X (x1, XV) = 


which describes the response at the site / to a change in the field at the site I’. 
The correlation function is defined as 


G (x1, xv) = (a10v) — (01) ov") 
= ((01 — (a1) (ov — (or))) - (6.5.35) 


The correlation function (6.5.35) is a measure of how strongly the deviations 
from the mean values at the sites | and I’ are correlated with each other. 
Susceptibility and correlation function are related through the important 
fluctuation-response theorem 


1 
(X10) = Ga G (x1, x0’) - (6.5.36) 
This theorem (6.5.36) can be derived by taking the derivative of (6.5.33) with 


respect to hy. 
For a translationally invariant system, we have 


X(X1, Xv) |n,=0} = x(x1— xv) and G(x, Xv )|pn,=0} = G(x; — xy) . 
(6.5.37) 


At small fields hj, we find (m} = mj — m) 


m= Sox (x1 — xv) he « (6.5.38) 
U 
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A periodic field 
hy = hae (6.5.39) 


therefore gives rise to a magnetization of the form 


mj = EPS © y (x) — xy) OI hg = x(q) e™! hg , (6.5.40) 
UV 
where 
1 
foe _ —iq(xi—x),/) _ 7 —iqx 
x(a) = xu xy )e AVN) = Ep 6 (i) & oe (6.5.41) 


is the Fourier transform of the susceptibility, and following the equals sign 
(6.5.36) has been inserted. In particular for q = 0, we find the following 
relation between the uniform susceptibility and the correlation function: 


x =x (0) = Gx). (6.5.42) 
l 


Since the correlation function (6.5.35) can never be greater than 1, (|o1| = 
1), and is in no case divergent, the divergence of the uniform susceptibility, 
Eq. (6.5.21) (i.e. the susceptibility referred to a spatially uniform field) can 
only be due to the fact that the correlations at T, attain an infinitely long 
range. 


6.5.4 The Ornstein—Zernike Correlation Function 


We now want to calculate the correlation function introduced in the previous 
section within the molecular field approximation. As before, we denote the 
field by hy, so that the mean value m; = (0;) is also site dependent. In the 
molecular-field approximation, the density matrix is given by 


pmet = Z~* exp Dy oi(hi +S. J(l- (or) ; (6.5.43) 
I ie 


The Fourier transform of the exchange coupling, which we take to be short- 
ranged, can be written for small wavenumbers as 


3 : Z 1 . 
J(k) = So (De & Jk? - So xf) = Jk. (6.5.44) 
l l 


Here, we have replaced the exponential function by its Taylor series. Due to 
the mirror symmetry of a cubic lattice, J (—1) = J (I), and therefore there is 


no linear term in k. Furthermore, we have > (k - xy)” J (I) = $k? 2 x?J (I). 
i i 
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The constant J is defined by 
1 2 
J== » x7 J (1). (6.5.45) 


Using the density matrix (6.5.43), we obtain for the mean value of o;, in 
analogy to (6.5.14) in Sect. 6.5.2, the result 


(or) = tanh [Bu + > IL-0) (or))] . (6.5.46) 
: 


We now take the derivative Bh of the last equation (6.5.46),and finally set 


all the hy = 0, obtaining for the susceptibility: 
1 
7 x 
cosh [Bw J(l si Wm] 
x (35 + BSI =U") x(x - x1’)) 


dd 


x(x — XV) = 
(6.5.47) 


The Fourier-transformed susceptibility (6.5.41) is obtained from (6.5.47), re- 
calling the convolution theorem: 


1 


Cerra 


(3+ 8J(a)x(a)) « (6.5.48) 


een = oe. where we have in- 


serted the determining equation for m, Eq. (6.5.16), we obtain the general 
result 


Furthermore, using cosh” BJIm = 


x(a) = =< G (6.5.49) 


From this last equation, together with (6.5.15) and (6.5.44), we find in the 
neighborhood of Ti: 


x (a) = Z 


7 Te 2, Jq@ 
l—- +m + FF 


for TT, (6.5.50) 


or also 


1 


x(q) = Tiere)’ (6.5.50’) 


where the correlation length 


JT\? (7-12 TT 
= (— 6.5.51 
é (Gr) eee T<T. ( ) 
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has been introduced, with rt = (T — T.) /T.. The susceptibility in real space 
is obtained by inverting the Fourier transform: 


1 . V . 
=e e = iq(x1—x)1) _ fig ig(x1 >) 
xO0—xr) =~ Do x(abe wasp | Paxtae 
? (6.5.52) 
For the second equals sign it was assumed that the system is macroscopic, so 
that the sum over q can be replaced by an integral (cf. (4.1.2b) and (4.1.14a) 
with p/h q) . 

To compute the susceptibility for large distances it suffices to make use of 
the result for y(q) at small values of q (Eq. (6.5.50’)); then with the lattice 
constant a we find 

ela —x)/) 


a® fe = 
Qn J OTT +E*) ~ An Jixr— xr] | 


From y calculated in this way, we find the correlation function via (6.5.37): 


ase7l*i— Xu! I/é 


x (x1 — xy) = (6.5.53) 


kT a3enP*l/é ; 
G (x) = kT x (x) = ee Gre (6.5.53) 
which in this context is called the Ornstein—Zernike correlation function. The 
Ornstein—Zernike correlation function and its Fourier transform are shown in 
Fig. 6.10 and Fig. 6.11 for the temperatures T = 1.01 T, and T = T,. In these 
figures, the correlation length € at T = 1.01 JT, is also indicated. The quantity 
& is defined by € = (J/kT.)/?, according to (6.5.48). At large distances y(x) 
1 


decreases exponentially as ibe e—!*I/§. The correlation length € characterizes 


%(x)kT, 107 X (a)kT, 


120 


3 ‘“ \ 
2 
wo] Te 
0 ~ 
0 
5 9 6s 20 x/Eq 0 ‘ = = 
S/8o di 0285 o“gs! 9 


Fig. 6.10. The Ornstein—Zernike cor- 
relation function for T’ = 1.01 T. and 


for T = T.. Distances are measured in 
units of & = (J/kTe)'/?. 


Fig. 6.11. The Fourier transform of 
the Ornstein—Zernike susceptibility for 
T = 1.01 T. and for T = T.. The re- 
ciprocal of the correlation length for 
T = 1.01 T. is indicated by the arrow. 
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the typical length over which the spin fluctuations are correlated. For |x| > €, 
G (x) is practically zero. At T., € = oo, and G(x) obeys the power law 


kT.u 


CO) An J |x| 


(6.5.54) 


with the volume of the unit cell v = a3. y(q) varies as 1/q? for €-' < q and 
for q = 0, it is identical with the Curie-Weiss susceptibility. On approaching 
T., x (0) becomes larger and larger. We note further that the continuum 
theory and thus (6.5.50’) and (6.5.52) apply only to the case when |x| > a. 
An important experimental tool for the investigation of magnetic phenom- 
ena is neutron scattering. The magnetic moment of the neutron interacts 
with the field produced by the magnetic moments in the solid and is there- 
fore sensitive to magnetic structure and to static and dynamic fluctuations. 
The elastic scattering cross-section is proportional to the static susceptibil- 
ity x(q). Here, q is the momentum transfer, q = kin — kout, where kin (out) 
are the wave numbers of the incident and scattered neutrons. The increase 
of x(q) at small q for T — T, leads to intense forward scattering. This is 
termed critical opalescence near the Curie temperature, in analogy to the 
corresponding phenomenon in light scattering near the critical point of the 
gas-liquid transition. 

The correlation length € diverges at the critical point; the correlations 
become more and more long-ranged as T;, is approached. Therefore, statistical 
fluctuations of the magnetic moments are correlated with each other over 
larger and larger regions. Furthermore, a field acting at the position x induces 
a polarization not only at that position, but also up to a distance €, as a result 
of (6.5.37). The increase of the correlations can also be recognized in the spin 
configurations illustrated in Fig. 6.12. Here, ‘snapshots’ from a computer 
simulation of the Ising model are shown. White pixels represent o = +1 and 
black pixels are for o = —1. At twice the transition temperature, the spins 
are correlated only over very short distances (of a few lattice constants). 
At T = 1.17%, the increase of the correlation length is clearly recognizable. 


Fig. 6.12. A ‘snapshot’ of the spin configuration of a two-dimensional Ising model 
at T = 27.,T = 1.17. and T = T.. White pixels represent o = +1, and black 
pixels refer to 0 = —1. 
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Along with very small clusters, both the black and the white clusters can 
be made out up to the correlation length €(T = 1.17.). At T = T., € = oo. 
In the figure, one sees two large white and black clusters. If the area viewed 
were to be enlarged, it would become clear that these are themselves located 
within an even larger cluster, which itself is only a member of a still larger 
cluster. There are thus correlated regions on all length scales. We observe 
here a scale invariance to which we shall return later. When we enlarge the 
unit of length, the larger clusters become smaller clusters, but since there are 
clusters up to infinitely large dimensions, the picture remains the same. 
The Ornstein—Zernike theory (6.5.51) and (6.5.53) reproduces the correct 
behavior qualitatively. The correlation length diverges however in reality as 
€ = oT”, where in general v # 4, and also the shape of G(x) differs from 
(6.5.53’) (see Chap. 7). 


*6.5.5 Continuum Representation 


6.5.5.1 Correlation Functions and Susceptibilities 


It is instructive to derive the results obtained in the preceding sections in a contin- 
uum representation. The formulas which occur in this derivation will also allow a 
direct comparison with the Ginzburg-Landau theory, which we will treat later (in 
Chap. 7). Critical anomalies occur at large wavelengths. In order to describe this 
region, it is sufficient and expedient to go to a continuum formulation: 


hi > h(x) , 01 > o (x) ,mi > M(x) , 


So hia = | Fawow . (6.5.55) 


Here, a is the lattice constant and v = a? is the volume of the unit cell. The sum over 
1 becomes an integral over x in the limit v — oo. The partial derivative becomes a 
functional derivative?’ (v — 0) 


dm (x) _ 1 Om oh (x) 


tC., e.g. Ge) =6(x—-x’). (6.5.56) 


éh(x') v Ohy 


For the susceptibility and correlation function we thus obtain from (6.5.34) 


(Gt dm(x) Om _ 1 
x dh(x’)  Ohy kT 


G(x-x’). (6.5.57) 


For small h (x), we find 


3! 
m (x) = “ x (x-—x’) h(x’) . (6.5.58) 


25 The general definition of the functional derivative is to be found in W. I. Smirnov, 
A Course of Higher Mathematics, Vol. V, Pergamon Press, Oxford 1964 or in 
QM I, Sect. 13.3.1 
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A periodic field 


h(x’) = hq ei (6.5.59) 
induces a magnetization of the form 
i d® ' -i —x’ iqx 
m (x) = dm f = x (x —x')e ae) ha =x(q)e™ ha, (6.5.60) 
where 
d’y —iay 1 3) mi 
= /— “ay = —_ Jd pod 5.61 
x (a) / oe) om! ye ™ Gly) (6.5.61) 


is the Fourier transform of the susceptibility, and after the second equals sign, we 
have made use of (6.5.37). In particular, for q = 0, we find the following relation 
between the uniform susceptibility and the correlation function: 


X=x0)= Ff Py Gy). (6.5.62) 


6.5.5.2 The Ornstein—Zernike Correlation Function 
As before, the field h (x) and with it also the mean value (o(x)) are position depen- 


dent. The density matrix in the molecular field approximation and in the continuum 
representation is given by: 


purr = Z~* exp E / To (x) (no) + i TY 3 (xx!) (9 («')))] . (6.5.63) 


The Fourier transform of the exchange coupling for small wavenumbers assumes 
the form 


P= 3 re ww 3 me 
J (k) = [ S30 go Rad sk? f OF 2 7(x) = FBS, (6.5.64) 


Vv 6 Vv 


where the exponential function has been replaced by its Taylor expansion. Owing 
to the spherical symmetry of the exchange interaction J(x) = J(|x|), there is no 
linear term in k and we find f d*x (kx)? J (x) = 4k? [ d®x x?J (x). The constant 
J is defined by J = x f d°x x? J (x). The inverse transform of (6.5.64) yields 


I(x) =v (J ao Jv?) 5(x) . (6.5.65) 
For phenomena at small k or large distances, the real position dependence of the 
exchange interaction can be replaced by (6.5.65). We insert this into (6.5.63) and 
obtain the mean value of o (x), analogously to (6.5.14) in Sect. 6.5.2: 


(o (x)) = tanh [3 G (x) + F(a (x)) + JV?{o(x))) | (6.5.66) 


In the neighborhood of T,, we can carry out an expansion similar to that in (6.5.16), 


m (x) = (0 (x)), 


(6.5.67) 
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with 7 = (T — T.) /T-, where the second term on the left-hand side occurs due to the 
spatial inhomogeneity of the magnetization. The equations of the continuum limit 
can be obtained from the corresponding equations of the discrete representation 
at any step, e.g. (6.5.67) follows from (6.5.46), by carrying out the substitutions 
(ou) = ma + m (x), J () + I(x) = (F + JV?) 5 (x). 


Now we take the functional derivative They of the last equation, (6.5.67), 


TO V+ mi x (x — x’) = v6 (x — x’) /kTe . (6.5.68) 


Since the susceptibility is calculated in the limit h — 0, the spontaneous magne- 
tization mo, which is given by the molecular-field expressions (6.5.17a,b), appears 
on the left side. The solution of this differential equation, which also occurs in 
connection with the Yukawa potential, is given in three dimensions by 


ver |/é 


/ et 
x (x — x’) ae (6.5.69) 
The Fourier transform is 
1 

= —____ 6.5.70 
x (a) Tie +e?) ( ) 

In this expression, we have introduced the correlation length: 

PANNE ee T> Te 

=|— 6.5.71 
6 (x) (ager ee ( ) 


The results thus obtained agree with those of the previous section; for their discus- 
sion, we refer to that section. 


*6.6 The Dipole Interaction, Shape Dependence, Internal 
and External Fields 


6.6.1 The Hamiltonian 


In this section, we investigate the influence of the dipole interaction. The 
total Hamiltonian for the magnetic moments jz, is given by 


H = Ho({mn}) + Ha({wr}) — $0 aH - (6.6.1) 
i 


Ho contains the exchange interaction between the magnetic moments and 
Ha represents the dipole interaction 


1 
Ha = 5 Ane oe ey 
LU 


(6.6.2) 
1 Sa 3(x1 — Xv), (%1 — Xr’) - 
od ( p p by Me ’ 
LU 


xx — xp? x1 — xp” 


and H, is the externally applied magnetic field. The dipole interaction is long- 
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ranged, in contrast to the exchange interaction; it decreases as the third power 
of the distance. Although the dipole interaction is in general considerably 
weaker than the exchange interaction — its interaction energy corresponds to 
a temperature of about”® 1 K — it plays an important role for some phenomena 
due to its long range and also due to its anisotropy. 

The goal of this section is to obtain predictions about the free energy and 
its derivatives for the Hamiltonian (6.6.1), 


F(T, Ha) = —kT log Tr e~*4/*F (6.6.3) 


and to analyze the modifications which result from including the dipole inter- 
action. Before we turn to the microscopic theory, we wish to derive some ele- 
mentary consequences of classical magnetostatics for thermodynamics; their 
justification within the framework of statistical mechanics will be given at 
the end of this section. 


6.6.2 Thermodynamics and Magnetostatics 


6.6.2.1 The Demagnetizing Field 


It is well known from electrodynamics?’ (magnetostatics) that in a magne- 


tized body, in addition to the externally applied field Ha, there is a demagne- 
tizing field Hg which results from the dipole fields of the individual magnetic 
moments, so that the effective field in the interior of the magnet, H;, 


H; =H.+H,, (6.6.4a) 


is in general different from H,. The field Hg is uniform only in ellipsoids and 
their limiting shapes, and we will thus limit ourselves as usual to this type of 
bodies. For ellipsoids, the demagnetizing field has the form Hg = —DM and 
thus the (macroscopic) field in the interior of the body is 


H;=H,-DM. (6.6.4b) 


Here, D is the demagnetizing tensor and M is the magnetization (per unit 
volume). When H, is applied along one of the principal axes, D can be 
interpreted as the appropriate demagnetizing factor in Eq. (6.6.4b). For Ha 
and therefore M parallel to the axis of a long cylindrical body, D = 0; for Ha 
and M perpendicular to an infinitely extended thin sheet, D = 47; and for a 
sphere, D = a The value of the internal field thus depends on the shape of 
the sample and the direction of the applied field. 


°6 See e.g. the estimate in N. W. Ashcroft and N.D. Mermin, Solid State Physics, 
Holt, Rinehart and Winston, New York, 1976, p. 673. 

27 A. Sommerfeld, Electrodynamics, Academic Press, New York 1952; R. Becker 
and F. Sauter, Theorie der Elektrizitdt, Vol. 1, 21st Edition, p. 52, Teubner, 
Stuttgart, 1973; R. Becker, Electromagnetic Fields and Interactions, Blaisdell, 
1964; J.D. Jackson, Classical Electrodynamics, 2nd edition, John Wiley, New 
York, 1975. 
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6.6.2.2 Magnetic Susceptibilities 


We now need to distinguish between the susceptibility relative to the applied 
field, ya(Ha) = or, and the susceptibility relative to the internal field, 
xi( Ai) = ou. We consider for the moment only fields in the direction of 
the principal axes, so that we do not need to take the tensor character of 
the susceptibilities into account. We emphasize that the usual definition in 
electrodynamics is the second one. This is due to the fact that y;(H;) is a 
pure materials property”®, and that owing to curl H; = *5j, the field H; can 
be controlled in the core of a coil by varying the current density j. 

Taking the derivative of Eq. (6.6.4b) with respect to M, one obtains the 
relation between the two susceptibilities: 


‘ee 
Xi(Hi) — Xa( Ha) 
It is physically clear that the susceptibility y;(H;) relative to the internal 
field H; acting in the interior of the body is a specific materials parameter 


which is independent of the shape, and that therefore the shape dependence 
of Xa (Ha) 


D. (6.6.5a) 


Xa(Ha) = eee (6.6.5b) 


results form the occurrence of D in (6.6.5b) and (6.6.4b).?9 

If the field is not applied along one of the principal axes of the ellipsoid, 
one can derive the tensor relation by taking the derivative of the component 
a of (6.6.4b) with respect to Mz: 


Oe es OG ge ae (6.6.5) 


Relations of the type (6.6.5a—c) can be found in the classical thermodynamic 
literature.°° 


?8 In the literature on magnetism, xi(H;) is called the true susceptibility and 
Xa(Ha) the apparent susceptibility. E. Kneller, Ferromagnetismus, Springer, 
Berlin, 1962, p. 97. 

29 When x; < 1074, as in many practical situations, the demagnetization correction 
can be neglected. On the other hand, there are also cases in which the shape of 
the object can become important. In paramagnetic salts, x; increases at low 
temperatures according to Curie’s law, and it can become of the order of 1; in 
superconductors, 47x; = —1 (perfect diamagnetism or Meissner effect). 

30 R. Becker and W. Doring, Ferromagnetismus, Springer, Berlin, 1939, p. 8; 
A.B. Pippard, Elements of Classical Thermodynamics, Cambridge at the Uni- 
versity Press 1964, p. 66. 
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6.6.2.3 Free Energies and Specific Heats 
Starting from the free energy F(T, H,) with the differential 


dF = —SdT — VMdH, , (6.6.6) 
we can define a new free energy by means of a Legendre transformation 
F(T,H;) = F(T,H,) + i MD? M® ; (6.6.7a) 


The differential of this free energy is, using (6.6.4b), given by 
dF (T,H;) = —SdT — VMdH,; . (6.6.7b) 


Since the entropy 5(T,H;) and the magnetization M(7,H;) as functions 
of the internal field must be independent of the shape of the sample, all 
the derivatives of F (T,H;) are shape independent. Therefore, the free energy 
F(T, Hj) is itself shape independent. From(6.6.6) and (6.6.7b), it follows that 


OF OF 
Gc (=) 2 -($) (6.6.8) 
OD) OT) py, 
and 
1 (OF 1 (OF 
a = , (6.6.9) 
V\OH./ + V\ OH.) p 
The specific heat can also be defined for a constant internal field 
T (OS 
= =(— 6.1 
Cu; V ($7) e (6.6.10a) 
and for a constant applied (external) field 
T (OS 
Cra=s=(=—= ; 6.6.10b 
a (a (sr), ( ) 
Using the Jacobian as in Sect. 3.2.4, one can readily obtain the following relations 
1 
6.11 
Ci, = Cn TR (6.6.11a) 
(2M D (2%) 
oT in oT) H 
a tT 2 ~ .6.11b 
Cu, = Cu, [a Dye (6.6 ) 
and 
(2) D (2) 
T ) Hr; oT) H 
= -T - 6.11 
Cu, = Cu, cme ; (6.6.11c) 


where the index T indicates the isothermal susceptibility. The shape independence 
of xi(H;) and F (T, H;), which is plausible for the physical reasons given above, has 
also been derived using perturbation-theoretical methods.*! For a vanishingly small 
field, the shape-independence could be proven without resorting to perturbation 
theory. 


3! P.M. Levy, Phys. Rev. 170, 595 (1968); H. Horner, Phys. Rev. 172, 535 (1968) 
°° R.B. Griffiths, Phys. Rev.176, 655 (1968) 
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6.6.2.4 The Local Field 


Along with the internal field, one occasionally also requires the local field 
joc. It is the field present at the position of a magnetic moment. One obtains 
it by imagining a sphere to be centered on the lattice site under consideration, 
which is large compared to the unit cell but small compared to the overall 
ellipsoid (see Fig. 6.13). We obtain for the local field:?” 


Moc = Ay + eM (6.6.12a) 
4 
with ¢=¢0+ > =a: (6.6.12b) 


Here, ¢o is the sum of the dipole fields of the average moments within the 
fictitious sphere. The medium outside the imaginary sphere can be treated as 
a continuum, and its contribution is that of a solid polarized ellipsoid (—D), 
minus that of a polarized sphere (42). For a cubic lattice, ¢@9 vanishes for 
reasons of symmetry.?” One can also introduce a free energy 


A 1 
F(T, Hioc) = F(L, Ha) — 5VM¢M (6.6.13a) 
with the differential 

dF = —SdT — VM dH ox . (6.6.13b) 


Since, owing to (6.6.12a,b), (6.6.7a), and (6.6.13a), it follows that 


A 7 1 4 
F(L, Hioc) = F(T, Hi) + 5VM (< iH =) M, (6.6.14) 


so that F differs from F only by a term which is independent of the exter- 
nal shape and is itself therefore shape-independent. One can naturally also 


g 
( 


Fig. 6.13. The definition of the local field. An ellipsoid of 
volume V and a fictitious sphere of volume Vo (schematic, 
not to scale) 
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define susceptibilities at constant H},, and derive relations corresponding to 
equations (6.6.13a-c) and (6.6.11a-c), in which essentially H; is replaced by 
Moc and D by @. 


6.6.3 Statistical-Mechanical Justification 


In this subsection, we will give a microscopic justification of the thermo- 
dynamic results obtained in the preceding section and derive Hamiltoni- 
ans for the calculation of the shape-independent free energies F(T, H;) and 


F(T, Hoc) of equations (6.6.7a) and (6.6.13a). The magnetic moments will 
be represented by the their mean values and fluctuations. The dipole inter- 
action will be decomposed into a short-range and a long-range part. For the 
interactions of the fluctuations, the long-range part can be neglected. The 
starting point will be the Hamiltonian (6.6.1), in which we introduced the 
fluctuations around (deviations from) the mean value (1?) 


buy = uy — (uy) : (6.6.15) 


H = Ho({mit) + Sa our ig PS 5 Ai P) ui) 


Ul’ Ul’ 


ie An? Ou (Hp) es wp He 


Lv 
Ho({mi}) + 29 Au? uy bub — Lar ) ui") 
LU Lv 


— So up (Ae + HG) (6.6.16) 
l 


with the thermal average of the field at the lattice point / due to the remaining 
dipoles: 


Hi, =— > Agr (ui) - (6.6.17) 
it 
For ellipsoids in an external magnetic field, the magnetization is uniform, 


(Cult = 4M BP); likewise the dipole field (demagnetizing field): 


HY, = He. = (¢0 + Do- D)agM? . (6.6.18) 


ap 
In going from (6.6.17) to (6.6.18), the dipole sum 


bag = — oe Aa? = (¢0 + Do — D) ag (6.6.19) 
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was decomposed into a discrete sum over the subvolume Vo (the Lorentz 
sphere) and the region V — Vo, in which a continuum approximation can be 
applied: 


a0a41 
—y =~ 3 ow 
(Do - P)ag I . ” Oza Ox, |x| 


=bo0(f fp Ban a ae 2). 


dea |x| Js, °° xg |x| 


(6.6.20) 


The first surface integral ee over the surface of the Lorentz sphere and 
the second over the (external) surface of the ellipsoid (sample). 
With this, we can write the Hamiltonian in the form 


H = Ho( ({u}) + en buy Ouly — Hi Hit 5 5VMa bapMsg . 
U3 


(6.6.21) 


Since the long-range property of the dipole interaction plays no role in the in- 
teraction between the fluctuations 6,1;, the first two terms in the Hamiltonian 
are shape-independent. The sample shape enters only in the local field Ajo. 
and in the fourth term on the right-hand side. Comparison with (6.6.13a) 


shows that the free energy F (T, Mioc), which, apart from its dependence on 
Hoc, is Shape independent, can be determined by computation of the parti- 
tion function with the first three terms of (6.6.21). 


If the dipole interactions between the fluctuations is completely neglected,®* one 
obtains the approximate effective Hamiltonian 


HK = Hol ({p,}) ~ 2 HH (6.6.22) 


in which the dipole interaction expresses itself only in the demagnetizing field. 
The exact treatment of the second term, 4 >>); AY ous Ou in (6.6.21) 


is carried out as follows: since the expectation value based on approximate 
ent /§ 


application - the Ornstein—Zernike theory decreases as (dd) & ——., 
and Aj ~ = the interaction of the fluctuations is negligible at large dis- 
tances. The eheae of the sample thus plays no role in this term in the limit 
V — oo with the shape kept unchanged. One can thus replace Ae by 


Oo 8 - ex 
eAOP = ; 6.6.23 
e Ox% 0x8 |x| ( ) 
with the cutoff length o~!, or more precisely 
1 HiBine 1 Pate 
5 S- AvP Sut jue = = jim 1 jim 5 S- Ae’ out Jue : (6.6.24) 


LU LU 


33 J. H. van Vleck, J. Chem. Phys. 5, 320, (1937), Eq. (36). 
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Inserting du, = pi — (141), we obtain for the right-hand side of (6.6.24) 


* . 1 TA Qa Qa a 
iim jm, OY ss Aur (ui hr — 27 (Hi) + (uj (ui) ) 
Lv 


= lim lim De SAF ie be a (6.6.25) 


o—0V—00 2 


7 
Bio F M2 
a » (60 + Do)agM” uj 5 (0 + Do) 


In the order: first the thermodynamic limit V — oo, then o — 0, the first 
term in (6.6.25) is shape-independent. Since in the second and third terms, 
the sum over I’ is cut off by e~!*'~*v!?, the contribution —D due to the 
external boundary of the ellipsoid does not appear here. Inserting (6.6.24) 
and (6.6.25) into (6.6.21), we find the Hamiltonian in final form** 


H=H—- <M DM (6.6.26a) 
with 
“8 dq 
it = Ho({u)) + f (ne a ugAat ee « — So upHe . (6.6.26b) 
l 


Here, the Fourier transforms 


1 : 
Me = —_ S- e@ 1E 1) , (6.6.27a) 
acer 
Ash = S- a Gc aaa a (6.6.27b) 
140 


and the internal field H; = H, — DM have been introduced. The Fourier 
transform (6.6.27b) can be evaluated using the Ewald method °°; for cubic 
lattices, it yields®® 


1 (4r 3q%q? 2 
ap _ a a8 2 a aB 
AS -i(4 (se 2 +aig*q + (a24 —as(q")°) 5 


+ O(a4,(a")',(a")"(@")")) | (6.6.27b’) 


where v, is the volume of the primitive unit cell and the a, are constants 
which depend on the lattice structure. The first two terms in H, Eq. (6.6.26b), 


34 See also W. Finger, Physica 90 B, 251 (1977). 

35 P,P. Ewald, Ann. Phys. 54, 57 (1917); ibid., 54, 519 (1917); ibid., 64, 253 (1921) 

36 M.H. Cohen and F. Keffer, Phys. Rev. 99, 1135 (1955); A. Aharony and 
M.E. Fisher, Phys. Rev. B 8, 3323 (1973) 
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are shape-independent. The sample shape enters only into the internal field 
Hj, and in the last term of (6.6.26a). Comparison of Eq. (6.6.26a) with 
Eq. (6.6.7a) shows that the shape-independent free energy F(T, H;) can 
be calculated from the partition function derived from the Hamiltonian KH, 
Eq. (6.6.26b). We note in particular the nonanalytic behavior of the term 
daq3/q° in the limit g — 0; it is caused by the 1/r?-dependence of the dipole 
interaction. Due to this term, the longitudinal and transverse wavenumber- 
dependent susceptibilities (with respect to the wavevector) are different from 
each other.*” We recall that the short-ranged exchange interaction can be 
expanded as a Taylor series in q: 


1 S 
Ho = —5 fa J(qQ) Mglt—q 


J(q) = J - Jq? + O(q'). 


(6.6.28) 


In addition to the effects of the demagnetizing field and the resulting shape 
dependence, which we have treated in detail, the dipole interaction, even 
though it is in general much weaker than the exchange interaction, has a 
number of important consequences owing to its long range and its anisotropic 
character:?” (i) It changes the values of the critical exponents in the neighbor- 
hood of ferromagnetic phase transitions; (ii) it can stabilize magnetic order 
in systems of low dimensionality, which otherwise would not occur due to 
the large thermal fluctuations; (iii) the total magnetic moment pp = D0, by 
is no longer conserved. This has important consequences for the dynamics; 
and (iv) the dipole interaction is important in nuclear magnetism, where it 
is larger than or comparable to the indirect exchange interaction. 

We can now include the dipole interactions in the results of Sects. 6.1 to 
6.5 in the following manner: 


(i) If we neglect the dipole interaction between the fluctuations of the mag- 
netic moments du; = py — (1) aS an approximation, we can take the 
spatially uniform part of the dipole fields into account by replacing the 
field H by the local field Hyoc. 

(ii) If, in addition to the exchange interactions possibly present, we also 
include the dipole interaction between the fluctuations, then accord- 
ing to (6.6.26), the complete Hamiltonian contains the internal field H;. 
The field H must therefore be replaced by H,;; furthermore, the shape- 
dependent term —tMDM enters into the Hamiltonian H, Eq. (6.6.26a), 


and, via the term H, also the shape-independent part of the dipole in- 
teraction, i.e. Eq. (6.6.27b’). 


37 B®. Frey and F. Schwabl, Advances in Physics 43, 577 (1994) 
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6.6.4 Domains 


The spontaneous magnetization per spin, mo(T), is shown in Fig. 6.5. The to- 
tal magnetic moment of a uniformly magnetized sample without an external 
field would be Nmo(T), and its spontaneous magnetization per unit volume 
Mo(T) = Nmo(T)/V, where N is the overall number of magnetic moments. 
In fact, as a rule the magnetic moment is smaller or even zero. This results 
from the fact that a sample in general breaks up into domains with different 
directions of magnetization. Within each domain, |M(x,T)| = Mo(T). Only 
when an external field is applied do the domains which are oriented parallel 
to the field direction grow at the cost of the others, and reorientation oc- 
curs until finally Nmo(T) has been reached. The spontaneous magnetization 
is therefore also called the saturation magnetization. We want to illustrate 
domain formation, making use of two examples. 


(i) One possible domain structure in a ferromagnetic bar below T, is shown 
in Fig. 6.14. One readily sees that for the configuration with 45°-walls 
throughout the sample, 


divM=0. (6.6.29) 
Then it follows from the basic equations of magnetostatics 


div H; = —47 div M (6.6.30a) 
curlH; =0 (6.6.30b) 


that, in the interior of the sample, 
H,;=0, (6.6.31) 


and thus also B = 47M in the interior. From the continuity conditions it 
follows that B = H = 0 outside the sample. The domain configuration is 
therefore energetically more favorable than a uniformly magnetized sample. 
(ii) Domain structures also express themselves in a measurement of the to- 
tal magnetic moment M of a sphere. The calculated magnetization M = Mu 
as a function of the applied field is indicated by the curves in Fig. 6.15. 


Fig. 6.14. The domain structure in a prism-shaped sample 
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M M= H,/D 
Ti < Te 
To < T 
Fig. 6.15. The magnetization within a 
sphere as a function of the external field 
A,,T, < Tz < Tc; D is the demagnetizing 
Ha factor. 


Let the magnetization within a uniformly magnetized region as a function 
of the internal field H; = H, — DM be given by the function M = M(H;). 
As long as the overall magnetization of the sphere is less than the satura- 
tion magnetization, the domains have a structure such that H; = 0, and 
therefore, M = tHa must hold.*® For Ha = DMgpont, the sample is finally 
uniformly magnetized, corresponding to the saturation magnetization. For 
Ha > DMspont, M can be calculated from M = M(H, — DM). 


6.7 Applications to Related Phenomena 


In this section, we discuss consequences of the results of this chapter on 
magnetism for other areas of physics: polymer physics, negative temperatures 
and the melting curve of *He. 


6.7.1 Polymers and Rubber-like Elasticity 


Polymers are long chain molecules which are built up of similar links, the 
monomers. The number of monomers is typically N ~ 100,000. Examples 
of polymers are polyethylene, (CH,),,, polystyrene, (CgHg),, and rubber, 
(C5Hs) y, where the number of monomers is N > 100,000 (see Fig. 6.16). 


polethylene: polystyrene: 


H H 
—CH,— CH — CH,—CH 


4 
L oh 4 
—c—c—c—.... 
| | ft 
H H H 


Fig. 6.16. The structures of polyethylene and polystyrene 


38'S. Arajs and R. V Calvin, J. Appl. Phys. 35, 2424 (1964). 
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To find a description of the mechanical and thermal properties we set up the 
following simple model (see Fig. 6.17): the starting point in space of monomer 
1 is denoted by Xj, and that of a general monomer 7 by X;. The position 
(orientation) of the ith monomer is then given by the vector S; = Xj41 — Xj: 


SX hs S) SK KS Ke Re ee ET) 


We now assume that aside from the chain linkage of the monomers there are 
no interactions at all between them, and that they can freely assume any 
arbitrary orientation, ie. < S;-S; >=0 for i #7. The length of a monomer 
is denoted by a, ie. S? =a? . 


Fig. 6.17. A polymer, composed of a chain of monomers 


Since the line connecting the two ends of the polymer can be represented in 
the form 


N 
Xvi -Xi=S0S;, (6.7.2) 
w=1 


it follows that 

(Xn41— Xi) =0. (6.7.3) 
Here, we average independently over all possible orientations of the S;. The 
last equation means that the coiled polymer chain is oriented randomly in 


space, but makes no statement about its typical dimensions. A suitable mea- 
sure of the mean square length is 


((Xw41-X1)’) = Os s;)’) =aN. (6.7.4) 


We define the so called radius of gyration 


R= vcore = Xi)) =aN?, (6.7.5) 


which characterizes the size of the polymer coil that grows as the square root 
of the number of monomers. 
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In order to study the elastic properties, we allow a force to act on the 
ends of the polymer, i.e. the force F acts on X y+, and the force —F on Xj 
(see Fig. 6.17). Under the influence of this tensile force, the energy depends 
on the positions of the two ends: 


ae CO ees 
=— (Xvi —Xy) + (Xn —Xy-1) Sa Na (Xo —X,)] -F 


N 
=-F-)°S;. (6.7.6) 
t=1 


Polymers under tension can therefore be mapped onto the problem of a para- 
magnet in a magnetic field, Sect. 6.3. The force corresponds to the applied 
magnetic field in the paramagnetic case, and the length of the polymer chain 
to the magnetization. Thus, the thermal average of the distance vector be- 
tween the ends of the chain is 


N 
aF kT\F 
L= iyo Nal coth == = \—. 6.7.7 
(os) a (co kT =) ¥ (eee?) 


We have used the Langevin function for classical moments in this expression, 
Eq. (6.3.12b), and multiplied by the unit vector in the direction of the force, 
F/F. If aF is small compared to kT, we find (corresponding to Curie’s law) 


Na? 

L= 3ere - (6.7.8) 
For the change in the length, we obtain from the previous equation 

OL 1 

paae ytd! 6.7.9 

OF T lee 
and 

OL Na? 


The length change per unit force or the elastic constant decreases with in- 
creasing temperature according to (6.7.9a). A still more spectacular result is 
that for the expansion coefficient oh rubber contracts when its temperature 
is increased! This is in complete contrast to crystals, which as a rule expand 
with increasing temperature. The reason for the elastic behavior of rubber is 
easy to see: the higher the temperature, the more dominant is the entropy 
term in the free energy, F = E — TS, which strives towards a minimum. 
The entropy increases, i.e. the polymer becomes increasingly disordered or 
coiled and therefore pulls together. The general dependence of the length on 
a\F|/kT is shown in Fig. 6.18. 
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Fig. 6.18. The length of a poly- 
mer under the influence of a tensile 
a|F|kT force F. 
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Remark: In the model considered here, we have not taken into account 
that a monomer has a limited freedom of orientation, since each position 
can be occupied by at most one monomer. In a theory which takes this 
effect into account, the dependence R = aN1/? in Eq. (6.7.5) is replaced 
by R = aN”. The exponent v has a significance analogous to that of the 
exponent of the correlation length in phase transitions, and the degree of 
polymerization (chain length) N corresponds to the reciprocal distance from 
the critical point, t~'. The properties of polymers, in which the volume 
already occupied is excluded, correspond to a random motion in which the 
path cannot lead to a point already passed through (self-avoiding random 
walk). The properties of both these phenomena follow from the n-component 
¢* model (see Sect. 7.4.5) in the limit n — 0.°9 An approximate formula for 
v is due to Flory: vpiory = 3/(d + 2). 


6.7.2 Negative Temperatures 


In isolated systems whose energy levels are bounded above and below, ther- 
modynamic states with negative absolute temperatures can be established. 
Examples of such systems with energy levels that are bounded towards higher 
energies are two-level systems or paramagnets in an external magnetic field 
h. 

We consider a paramagnet consisting of N spins of quantum number 
S = 1/2 with an applied field along the z direction. Considering the quan- 


tum numbers of the Pauli spin matrices 0; = +1, the Hamiltonian has the 
following diagonal structure 
H=-h> or. (6.7.10) 
1 


The magnetization per lattice site is defined by m = (c) and is independent 
of the lattice position 1. The entropy is given by 


39 P_-G. de Gennes, Scaling Concepts in Polymer Physics, Cornell University Press, 
Ithaca, 1979. 


6.7 Applications to Related Phenomena 321 


1 1 1— 1- 
S(m) = -—kN es ual log us 

2 2 2 2 

N N (6.7.11) 
= —k|N, log —+ + N_ log — 
| +108 7, + 8 Hy , 
and the internal energy E depends on the magnetization via 

E=—Nhm=-—h(Ni-—N_), (6.7.12) 


with Nz = N(1 +4 m)/2. These expressions follow immediately from the 
treatment in the microcanonical ensemble (Sect. 2.5.2.2) and can also be 
obtained from Sect. 6.3 by elimination of T and B. For m = 1 (all spins 
parallel to the field h), the energy is E = —Nh; for m = —1 (all spins 
antiparallel to h), the energy is E = Nh. The entropy is given in Fig. 2.9 as 
a function of the energy. It is maximal for F' = 0, i.e. in the state of complete 
disorder. The temperature is obtained by taking the derivative of the entropy 
with respect to the energy: 


T= 


2h 1 = 
= c =m) (6.7.13) 


1 

(3B) p k “T= 
It is shown as a function of the energy in Fig. 2.10. In the interval 0 < m <1, 
ie. -—1 < E/Nh < 0, the temperature is positive, as usual. For m < 0, 
that is when the magnetization is oriented antiparallel to the magnetic field, 
the absolute temperature becomes negative, i.e. JT < 0! With increasing 
energy, the temperature T goes from 0 to oo, then through —oo, and finally to 
—0. Negative temperatures thus belong to higher energies, and are therefore 
“hotter” than positive temperatures. In a state with a negative temperature, 
more spins are in the excited state than in the ground state. One can also 
see that negative temperatures are in fact hotter than positive by bringing 
two such systems into thermal contact. Take system 1 to have the positive 
temperature T,; > 0 and system 2 the negative temperature T2 < 0. We 
assume that the exchange of energy takes place quasistatically; then the total 
entropy is S = S$;(£1) + So(£2) and the (constant) total energy is EF = 


FE, + E,. From the increase of entropy, it follows with ihe = — that 
dS OS, dE, OS2 dE» 1 1 \ dE; 
0 = = 7.14 
SG OE, dt | OE, dt  \T Th) d on 


Since the factor in brackets, (2 + wa) is positive, “41 > 0 must also hold; 


this means that energy flows from subsystem 2 at a negative temperature 
into subsystem 1. 

We emphasize that the energy dependence of S(£) represented in Fig. 2.9 
and the negative temperatures which result from it are a direct consequence 
of the boundedness of the energy levels. If the energy levels were not bounded 
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from above, then a finite energy input could not lead to an infinite tempera- 
ture or even beyond it. We also note that the specific heat per lattice site of 
this spin system is given by 


C 2h \? e2h/kT 
= 7 (6.7.15) 
Nk kT (1 + e2h/kT) 


and vanishes both at T = +0 as well as at T= too. 
We now discuss two examples of negative temperatures: 


(i) Nuclear spins in a magnetic field: 

The first experiment of this kind was carried out by Purcell and Pound*? 
in a nuclear magnetic resonance experiment using the nuclear spins of “Li 
in LiF. The spins were first oriented at the temperature T’ by the field H. 
Then the direction of H was so quickly reversed that the nuclear spins could 
not follow it, that is faster than a period of the nuclear spin precession. The 
spins are then in a state with the negative temperature —T. The mutual 
interaction of the spins is characterized by their spin-spin relaxation time of 
10-°—10~§ sec. This interaction is important, since it allows the spin system 
to reach internal equilibrium; it is however negligible for the energy levels in 
comparison to the Zeeman energy. For nuclear spins, the interaction with the 
lattice in this material is so slow (the spin-lattice relaxation time is 1 to 10 
min) that the spin system can be regarded as completely isolated for times 
in the range of seconds. The state of negative temperature is maintained for 
some minutes, until the magnetization reverses through interactions with the 
lattice and the temperature returns to its initial value of T. In dilute gases, 
a state of spin inversion with a lifetime of days can be established. 

(ii) Lasers (pulsed lasers, ruby lasers): 

By means of irradiation with light, the atoms of the laser medium are excited 
(Fig. 6.19). The excited electron drops into a metastable state. When more 


level 


tl 


T<0 
(a) (b) 


{ Tae iit f ‘aie metastable 
¥ 


Fig. 6.19. Examples of negative temperatures: (a) nuclear spins in a magnetic field 
H, which is rotated by 180° (b) a ruby laser. The “pump” raises electrons into an 
excited state. The electron can fall into a metastable state by emission of a photon. 
When a population inversion is established, the temperature is negative 


40 &.M. Purcell and R. V. Pound, Phys. Rev. 81, 279 (1951) 
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electrons are in this excited state than in the ground state, i.e. when a pop- 
ulation inversion has been established, the state is described by a negative 
temperature. 


*6.7.3 The Melting Curve of 7He 


The anomalous behavior of the melting curve of *He (Fig. 6.20) is related to 
the magnetic properties of solid *He.*! As we already discussed in connection 


34 


3.0 


23) 3 a) a a Fig. 6.20. The melting curve 
T (mK) of “He at low temperatures 


with the Clausius—Clapeyron equation, 
aP _ S5~ 5 
aT Vg-Vy 


the solid has a higher entropy than the liquid in the temperature range below 
0.32 K. The minimum in the melting curve occurs according to the Clausius— 
Clapeyron equation just when the entropies are equal. The magnetic effects 
in °He result from the nuclear spins and are therefore considerably weaker 
than in electronic magnetism. The exchange interaction J is so small that 
the antiferromagnetic order in solid 3He sets in only at temperatures of 
Tn ~ 1073K. Hence, the spins are disordered and make a contribution to 
the entropy (cf. (6.3.16’)) of: 


$5 = Ne|og2-0(() )] (6.7.17) 


41 J. Wilks, The properties of Liquid and Solid Helium, Clarendon Press, Oxford, 
1967; C. M. Varma and N.,R. Werthamer, in K.-H. Bennemann and J. B. Ketter- 
son, Eds., The physics of liquid and solid He Part I, p. 549, J. Wiley, New York, 
1976; A.C. Anderson, W. Reese, J.C. Wheatley, Phys. Rev. 130, 495 (1963); 
O.V. Lounasmaa, Experimental Principles and Methods Below 1K, Academic 
Press, London, 1974. 


20; (6.7.16) 
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Since the lattice entropy at 0.3K is negligibly small, this is practically the 
entire entropy in the solid. The entropy of the liquid is (from Eq. (4.3.19)) 


SpxkN—— Tpeik. (6.7.18) 


According to the Clausius—Clapeyron equation (6.7.16), the melting curve 
has a minimum when Sy, = Ss: 


2T 2T PF 

Below Tyin, the slope of the melting curve ue = ante < 0, since there, 
Si < Sg and V, > Vg. This leads to the Pomeranchuk effect, already men- 
tioned in Sect. 3.8.2. The above estimate of Tin yields a value which is a 


factor of 2 smaller than the experimental result, T°. = 0.3K. This results 


min 

from the value of Si, which is too large. Compared to an ideal gas, there 
are correlations in an interacting Fermi liquid which, as can be understood 
intuitively, lead to a lowering of its entropy and to a larger value of Tyin. 

Before the discovery of the two superfluid phases of *He, the existence of a 
maximum in the melting curve below 10~°K was theoretically discussed.*! It was 
expected due to the T?-dependence of the specific heat in the antiferromagnetically 
ordered phase and the linear specific heat of the Fermi liquid. This picture however 
changed with the discovery of the superfluid phases of *He (see Fig. 4.10). The 
specific heat of the liquid behaves at low temperatures like e~4/*" , with a constant 
A (energy gap), and therefore the melting curve rises for T — 0 and has the slope 0 
at T = 0. 
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Problems for Chapter 6 


6.1 Derive (6.1.24c) for the Hamiltonian of (6.1.25), by taking the second derivative 
of 


A(T, M) = —kT log Tre?" + HM 
with respect to T for fixed M. 


6.2 The classical paramagnet: Consider a system of N non-interacting, classical 
magnetic moments, p; (\/u#? = m) in a magnetic field H, with the Hamiltonian 
H=- ae u;H. Calculate the classical partition function, the free energy, the 


entropy, the magnetization, and the isothermal susceptibility. Refer to the sugges- 
tions following Eq. (6.3.12b). 


6.3 The quantum-mechanical paramagnet, in analogy to the main text: 
(a) Calculate the entropy and the internal energy of an ideal paramagnet as a 
function of T. Show that for T — oo, 


S=NkIn(2J +1) , 


and discuss the temperature dependence in the vicinity of T = 0. 
(b) Compute the heat capacities Cy and Cm for a non-interacting spin-1/2 system. 


6.4 The susceptibility and mean square deviation of harmonic oscillators: 
Consider a quantum-mechanical harmonic oscillator with a charge e in an electric 
field E 


Show that the dielectric susceptibility is given by 
_ (ex) e 


dE mu 
and that the mean square deviation takes the form 


(x) = 5 coth MY : 


2wm 


from which it follows that 


2tanh 2 2 
fae 


Compare these results with the paramagnetism of non-coupled magnetic moments! 
Take account of the difference between rigid moments and induced moments, and 
the resulting different temperature dependences of the susceptibility. Take the clas- 
sical limit Ghw < 1. 


326 6. Magnetism 


6.5 Consider a solid with N degrees of freedom, which are each characterized by 
two energy levels at A and —A. Show that 


A dE AN? oh 
E =—NAtanh— = =Nk 
a OO ap (=) cosh? 


holds. How does the specific heat behave for T >> A/k and for T < A/k? 


6.6 When the system described in 6.5 is disordered, so that all values of A within 
the interval 0 < A < Apo occur with equal probabilities, show that then the specific 
heat for kT <« Ap is proportional to T. 

Hint: The internal energy of this system can be found from problem 6.5 by 
averaging over all values of A. This serves as a model for the linear specific heat of 
glasses at low temperatures. 


6.7 Demonstrate the validity of the fluctuation-response theorem, Eq. (6.5.35). 


6.8 Two defects are introduced into a ferromagnet at the sites x; and x2, and 
produce there the magnetic fields hi and hz. Calculate the interaction energy of 
these defects for |x1 — x2| > €. For which signs of the h; is there an attractive 
interaction of the defects? 

Suggestions: The energy in the molecular field approximation is 


B= Vu (S1) (Sy) JU). 

For each individual defect, (5;),, = G(x: — X1,2) h1,2, where G is the Ornstein- 
Zernike correlation function. For two defects which are at a considerable distance 
apart, (S,) can be approximated as a linear superposition of the single-defect av- 
erages. The interaction energy can be obtained by calculating FE for this linear 
superposition and subtracting the energies of the single defects. 


6.9 The one-dimensional Ising model: Calculate the partition function Zy for a 
one-dimensional Ising model with N spins obeying the Hamiltonian 


N-1 
Ha WS Sects 
i=l 


Hint: Prove the recursion relation Zy+1 = 2Zy cosh (Jn /kT). 


6.10 (a) Calculate the two-spin correlation function Gin := (5S; Sitn) for the one- 
dimensional Ising model in problem 6.9. 

Hint: The correlation function can be found by taking the appropriate derivatives 
of the partition function with respect to the interactions. Observe that $? = 1. 
Result: Gin = tanh” (J/kT) for J; = J. 

(b) Determine the behavior of the correlation length defined by Gin = e~"/* for 
T-0. 

(c) Calculate the susceptibility from the fluctuation-response theorem: 


x= (sus) Dt). 


Hint: Consider how many terms with |i — j| = 0, |i — j| =1, |i — j| = 2 etc. occur 
in the double sum. Compute the geometric series which appear. 
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Result: 


x= pa fy (Ba) Palme an 


kT l-a (1a) kT 


(d) Show that in the thermodynamic limit, (N — 00) x « € for T — 0, and thus 


y/v=1. 

(e) Plot y~! in the thermodynamic limit as a function of temperature. 

(f) How can one obtain from this the susceptibility of an antiferromagnetically- 
coupled linear chain? Plot and discuss x as a function of temperature. 


6.11 Show that in the molecular-field approximation for the Ising model, the in- 
ternal energy F is given by 


E= Guz m — hn) N 
and the entropy S by 


Te. . 
~m ppm log(2 cosh( Tian + )/KT)| ; 


s=kn| 


Inserting the equation of state, show also that 


l+m l+m ,l—=m l—m 
Ss kw (+ tog +4 5 tog =") 


Finally, expand a(T,m) = e — Ts + mh up to the 4th power in m. 


6.12 An improvement of the molecular field theory for an Ising spin system can 
be obtained as follows (Bethe—Peierls approximation): the interaction of a spin oo 
with its z neighbors is treated exactly. The remaining interactions are taken into 
account by means of a molecular field h’, which acts only on the z neighbors. The 
Hamiltonian is then given by: 


H = ABS, = iS Ga: —hoo. 
j=l j=l 


The applied field h acts directly on the central spin and is likewise included in h’. 
H' is determined self-consistently from the condition (a0) = (aj). 
(a) Show that the partition function Z (h’,T) has the form 


(b) Calculate the average values (9) and (oj) for simplicity with h = 0. 
Result: 


I 
| 
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» 
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(c) The equation (00) = (oj) has a nonzero solution below T-: 


ne 1 cosh (Ge + Be) 
= 5 log 7 
RT(@—1) 2°” cosh (ep — er) 


Determine T; and h’ by expanding the equation in terms of coe 
Result: 


tanh 2 =1/(z-1) 
h’ \? _. cosh? (J/kT) ej av 
(Zz) 3 sinh (J/RT) {tant kT z—1' =) 


6.13 In the so called Weiss model, each of the N spins interacts equally strongly 
with every other spin: 


H = —5 So Joov ~hSoo . 


Ll l 


Here, J = we, This model can be solved exactly; show that it yields the result of 
molecular field theory. 


6.14 Magnons (= spin waves) in ferromagnets. The Heisenberg Hamiltonian, which 
gives a satisfactory description of certain ferromagnets, is given by 


1 
H = —5 I (lx —xy|)SiSy , 


1 


where | and I’ are nearest neighbors on a cubic lattice. By applying the Holstein 
Primakoff transformation, 


St =V28y(m)a, Sp =V28aty(m), S7=S—m 


(Si = Spt iS/) with y(n) = f/1— 11/25, ni = ala and [ar al,] = dy, as well 
as fai, ay] = 0 — the spin operators are transformed into Bose operators. 

(a) Show that the commutation relations for the spin operators are fulfilled. 

(b) Represent the Heisenberg Hamiltonian up to second order (harmonic approx- 
imation) in the Bose operators a; by expanding the square roots in the above 
transformation in a Taylor series. 

(c) Diagonalize H (by means of a Fourier transformation) and determine the 
magnon dispersion relations. 


6.15 (a) Show that a magnon lowers the z-component of the total spin operator 
S*=S° Sf by h. 


1 
(b) Calculate the temperature dependence of the magnetization. 

(c) Show that in a one- and a two-dimensional spin lattice, there can be no ferro- 
magnetic order at finite temperatures! 
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6.16 Assume a Heisenberg model in an external field H, 


1 
H=-5 >) J(l-U) 8S) —p-H, 


Ll’ 


L 


Show that the isothermal susceptibilities |; (parallel to H) and x1 (perpendicular 
to H) are not negative. 

Suggestions: Include an additional field AH in the Hamiltonian and take the deriva- 
tive with respect to this field. For x), ic. AH ||H, the assertion follows as in 
Sect. 3.3 for the compressibility. For an arbitrarily oriented AH, it is expedient to 
use the expansion given in Appendix C. 


6.17 Denote the specific heat at constant magnetization by cv, and at constant 
field by cz. Show that relation (6.1.22c) holds for the isothermal and the adiabatic 
susceptibility. Volume changes of the magnetic material are to be neglected here. 


6.18 A paramagnetic material obeys the Curie law 
M= as ; 
T 


where c is a constant. Show, keeping in mind T dS = dE — HdM, that 


dTad = 


val 
cdH 
cHT 
for an adiabatic change (keeping the volume constant). cy is the specific heat at 
constant magnetic field. 


6.19 A paramagnetic substance obeys the Curie law M = £H (c const.) and its 
internal energy E is given by E = aT“ (a > 0, const.). 

(a) What quantity of heat 6Q is released on isothermal magnetization if the mag- 
netic field is increased from 0 to Hy? 

(b) How does the temperature change if the field is now reduced adiabatically from 
HA, to 0? 


6.20 Prove the relationships between the shape-dependent and the shape-in- 
dependent specific heat (6.6.11a), (6.6.11b) and (6.6.11c). 


6.21 Polymers in a restricted geometry: Consider a polymer which is in a cone- 
shaped box (as shown). Why does the polymer move towards the larger opening? 
(no calculation necessary!) 


=> 


7. Phase Transitions, Scale Invariance, 
Renormalization Group Theory, and 
Percolation 


This chapter builds upon the results of the two preceding chapters deal- 
ing with the ferromagnetic phase transition and the gas-liquid transition. We 
start with some general considerations on symmetry breaking and phase tran- 
sitions. Then a variety of phase transitions and critical points are discussed, 
and analogous behavior is pointed out. Subsequently, we deal in detail with 
critical behavior and give its phenomenological description in terms of static 
scaling theory. In the section that follows, we discuss the essential ideas of 
renormalization group theory on the basis of a simple model, and use it to 
derive the scaling laws. Finally, we introduce the Ginzburg-Landau theory; 
it provides an important cornerstone for the various approximation methods 
in the theory of critical phenomena. 

The first, introductory section of this chapter exhibits the richness and 
variety of phase-transition phenomena and tries to convey the fascination of 
this field to the reader. It represents a departure from the main thrust of this 
book, since it offers only phenomenological descriptions without statistical, 
theoretical treatment. All of these manifold phenomena connected with phase 
transitions can be described by a single unified theory, the renormalization 
group theory, whose theoretical efficacy is so great that it is also fundamental 
to the quantum field theory of elementary particles. 


7.1 Phase Transitions and Critical Phenomena 


7.1.1 Symmetry Breaking, the Ehrenfest Classification 


The fundamental laws of Nature governing the properties of matter 
(Maxwell’s electrodynamics, the Schrodinger equation of a many-body sys- 
tem) exhibit a number of distinct symmetry properties. They are invariant 
with respect to spatial and temporal translations, with respect to rotations 
and inversions. The states which exist in Nature do not, in general, display 
the full symmetry of the underlying natural principles. A solid is invariant 
only with respect to the discrete translations and rotations of its point group. 

Matter can furthermore exist in different states of aggregation or phases, 
which differ in their symmetry and as a result in their thermal, mechanical, 
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and electromagnetic properties. The external conditions (pressure P, tem- 
perature T, magnetic field H, electric field E, ...) determine in which of the 
possible phases a chemical substance with particular internal interactions will 
present itself. If the external forces or the temperature are changed, at par- 
ticular values of these quantities the system can undergo a transition from 
one phase to another: a phase transition takes place. 

The Ehrenfest Classification: as is clear from the examples of phase transi- 
tions already treated, the free energy (or some other suitable thermodynamic 
potential) is a non-analytic function of a control parameter at the phase tran- 
sition. The following classification of phase transitions, due to Ehrenfest, is 
commonly used: a phase transition of n-th order is defined by the property 
that at least one of the n-th derivatives of its thermodynamic potential is 
discontinuous, while all the lower derivatives are continuous at the transi- 
tion. When one of the first derivatives shows a discontinuity, we speak of a 
first-order phase transition; when the first derivatives vary continuously but 
the second derivatives exhibit discontinuities or singularities, we speak of a 
second-order phase transition (or critical point), or of a continuous phase 
transition. 

The understanding of the question as to which phases will be adopted 
by a particular material under particular conditions certainly belongs among 
the most interesting topics of the physics of condensed matter. Due to the 
differing properties of different phases, this question is also of importance for 
materials applications. Furthermore, the behavior of matter in the vicinity of 
phase transitions is also of fundamental interest. Here, we wish to indicate 
two aspects in particular: why is it that despite the short range of the interac- 
tions, one observes long-range correlations of the fluctuations, in the vicinity 
of a critical point T. and long-range order below T..? And secondly, what is 
the influence of the internal symmetry of the order parameter? Fundamental 
questions of this type are of importance far beyond the field of condensed- 
matter physics. Renormalization group theory was originally developed in the 
framework of quantum field theory. In connection with critical phenomena, 
it was formulated by Wilson! in such a way that the underlying structure 
of nonlinear field theories became apparent, and that also allowed system- 
atic and detailed calculations. This decisive breakthrough led not only to an 
enormous increase in the knowledge and deeper understanding of condensed 
matter, but also had important repercussions for the quantum-field theo- 
retical applications of renormalization group theory in elementary particle 
physics. 


*7.1.2 Examples of Phase Transitions and Analogies 


We begin by describing the essential features of phase transitions, referring 
to Chaps. 5 and 6, where the analogy and the common features between 


1 K.G. Wilson, Phys. Rev. B 4, 3174, 3184 (1971) 
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melting 
ferromagnetic 


iD 


paramagnetic 


Te 


' 
Vapor |} 
pressure: 
curve! 


' 
gaseous! 


sublimation curve 
(a) (b) 


Fig. 7.1a,b. Phase diagrams of (a) a liquid (P-T’) and (b) a ferromagnet (H-T). 
(Triple point = T.P., critical point = C.P.) 


the liquid-gas transition and the ferromagnetic phase transition were already 
mentioned, and here we take up their analysis. In Fig. 7.1la,b, the phase 
diagrams of a liquid and a ferromagnet are shown. The two ferromagnetic 
ordering possibilities for an Ising ferromagnet (spin “up” and spin “down” ) 
correspond to the liquid and the gaseous phases. The critical point corre- 
sponds to the Curie temperature. As a result of the symmetry of the Hamil- 
tonian for H = 0 with respect to the operation 0; — —o; for all 1, the phase 
boundary is situated symmetrically in the H-T plane. 

Ferromagnetic order is characterized by the order parameter m at H = 0. 
It is zero above T, and + mo below T,, as shown in the M-T diagram in 
Fig. 7.1d. The corresponding quantity for the liquid can be seen in the V-T 
diagram of Fig. 7.1c. 

Here, the order parameter is (pz — pc) or (pq — fc). In everyday life, 
we usually observe the liquid-gas transition at constant pressure far below 
P.. On heating, the density changes discontinuously as a function of the 
temperature. Therefore, the vaporization transition is usually considered to 
be a first-order phase transition and the critical point is the end point of the 
vaporization curve, at which the difference between the gas and the liquid 
ceases to exist. The analogy between the gas-liquid and the ferromagnetic 
transitions becomes clearer if one investigates the liquid in a so called Natterer 
tube?. This is a sealed tube in which the substance thus has a fixed, given 
density. If one chooses the amount of material so that the density is equal 
to the critical density p., then above T, there is a fluid phase, while on 
cooling, this phase splits up into a denser liquid phase separated from the 
less dense gas phase by a meniscus. This corresponds to cooling a ferromagnet 


2 See the reference in Sect. 3.8. 


334 7. Phase Transitions, Renormalization Group Theory, and Percolation 


coexistence 
curve 
« 


Ve V 


Pc 


(c) 


(d) 


Tele TTe 


Fig. 7.1c,d. The order parameter for (c) the gas-liquid transition (below, two 
Natterer tubes are illustrated), and for (d) the ferromagnetic transition 


at H = 0. Above T;, the disordered paramagnetic state is present, while below 
it, the sample splits up into (at least two) negatively and positively oriented 
ferromagnetic phases.® Fig. 7.1e,f shows the isotherms in the P-V and M-H 
diagrams. The similarity of the isotherms becomes clear if the second picture 
is rotated by 90°. In ferromagnets, the particular symmetry again expresses 
itself. Since the phase boundary curve in the P-T diagram of the liquid is 
slanted, the horizontal sections of the isotherms in the P-V diagram are not 
congruent. Finally, Fig. 7.1g,h illustrates the surface of the equation of state. 

The behavior in the immediate vicinity of a critical point is characterized 
by power laws with critical exponents which are summarized for ferromag- 
nets and liquids in Table 7.1. As in Chaps. 5 and 6, T = Efe The critical 
exponents 3,y7,6,a for the order parameter, the susceptibility, the critical 
isotherm, and the specific heat are the goal of theory and experiment. Addi- 
tional analogies will be seen later in connection with the correlation functions 
and the scattering phenomena which follow from them. 


3 In Ising systems there are two magnetization directions; in Heisenberg systems 
without an applied field, the magnetization can be oriented in any arbitrary 
direction, since the Hamiltonian (6.5.2) is rotationally invariant. 
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coexistence _ 
curve 


Fig. 7.1g,h. The surface of the equation of state for a liquid (g) and for a ferro- 
magnet (h) 


The general definition of the value of a critical exponent of a function f (T — T-), 
which is not a priori a pure power law is given by 


exponent = lim dlog il = Te) ‘ 
TT. d log(T — Te) 


When f has the form f = a+ (T — T,), one finds: 


dlogja+T—T.) _ 1 1 _ peed bie 0 
d log(T — T.) a+(T-—T.) ee a+(T—T.) : 


(7.1.1) 


When f is logarithmically divergent, the following expression holds: 


d log log (T' — T-) 1 
= 0. 
d log(T — T-) log(T — Te) 
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In these two cases, the value of the critical exponent is zero. The first case occurs 
for the specific heat in the molecular field approximation, the second for the specific 
heat of the two-dimensional Ising model. The reason for introducing critical expo- 
nents even for such cases can be seen from the scaling laws which will be treated 
in the next section. To distinguish between the different meanings of the exponent 
zero (discontinuity and logarithm), one can write 0q and Ojog. 


Table 7.1. Ferromagnets and Liquids: Critical Exponents 


Ferromagnet Liquid Critical behavior 

(Va,t — Ve) 

Order M ae (7) T<T, 

parameter 
(pa,L — pe) 

Isothermal Magnetic Isothermal 

susceptibil- susceptibility compressibility x |r|77 T2T. 

: _ (2a _ a 

ity xr =(Sr)r kr =v (3p) + 

Critical ~ Me 

isotherm H = H(M) P=P(V-\) Vv.) PHL 

(T =T.) ~(V—Ve) 

Specific Cu=0 = Cx=0 as = 

heat =7(28),  W=Tary tr T2T 


We want to list just a few examples from among the multitude of phase 
transitions*. In the area of magnetic substances, one finds antiferromagnets 
(e.g. with two sublattices having opposite directions of magnetization My, 
and Mz), ferrimagnets, and helical phases. In an antiferromagnet with two 
sublattices, the order parameter is N = M, — Mg, the so called staggered 
magnetization. In binary liquid mixtures, there are separation transitions, 
where the order parameter characterizes the concentration. In the case of 
structural phase transitions, the lattice structure changes at the transition, 
and the order parameter is given by the displacement field or the strain ten- 
sor. Examples are ferroelectrics’ and distortive transitions, where the order 
parameter is given by e.g. the electric polarization P or the rotation angle y 
of a molecular group. Finally, there are transitions into macroscopic quantum 
states, i.e. superfluidity and superconductivity. Here, the order parameter is 
a complex field ~, the macroscopic wavefunction, and the broken symmetry 
is the gauge invariance with respect to the phase of w. In the liquid-solid 
transition, the translational symmetry is broken and the order parameter is 


4 We mention two review articles in which the literature up to 1966 is summa- 
rized: M.E. Fisher, The Theory of Equilibrium Critical Phenomena, p. 615; and 
P. Heller, Experimental Investigations of Critical Phenomena, p. 731, both in 
Reports on Progress in Physics XXX (1967). 

> In a number of structural phase transitions, the order parameter jumps discon- 
tinuously to a finite value at the transition temperature. In this case, according 
to Ehrenfest’s classification, we are dealing with a first-order phase transition. 
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a component of the Fourier-transformed density. This transition line does not 
end in a critical point. 

Table 7.2 lists the order parameter and an example of a typical substance 
for some of these phase transitions. 


Table 7.2. Phase transitions (critical points), order parameters, and substances 


Phase transition Order parameter Substance 


Paramagnet—ferromagnet Magnetization M Fe 
(Curie temperature) 


Paramagnet—antiferromagnet staggered N=Mi-M2 RbMnF3 

(Néel temperature) magnetization 

Gas-liquid Density P— Pe CO2 

(Critical point) 

Separation of binary Concentration c— Ce Methanol- 

liquid mixtures n-Hexane 

Order-—disorder Sublattice Na — Neg Cu-Zn 

transitions occupation 

Paraelectric-ferroelectric Polarization P BaTiO3 

Distortive structural Rotation angle SrTiO3 

transitions 

Elastic phase transitions Strain € KCN 

He I-He II (Lambda point) Bose condensate Ww ‘He 

Normal conductor—superconductor Cooper-pair A Nb3Sn 
amplitude 


In general, the order parameter is understood to be a quantity which is 
zero above the critical point and finite below it, and which characterizes the 
structural or other changes which occur in the transition, such as the expec- 
tation value of lattice displacements or a component of the total magnetic 
moment. 

To clarify some concepts, we discuss at this point a generalized anisotropic, 
ferromagnetic Heisenberg model: 


1 Zee Lx z 
KH — 75 » { Jy (i = l\o; OV + Ji (l = V') (a; OV + BN le 5 (7.1.2) 


where o; = (07, 0/,07) is the three-dimensional Pauli spin operator at lattice 


site x; and N is the number of lattice sites. This Hamiltonian contains the uni- 
azial ferromagnet for J\(l—I) > Ji (I—l’) = 0, and for J, (1-1) = J(U—-V), 
it describes the isotropic Heisenberg model (6.5.2). In the former case, the 
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order parameter referred to the number of lattice sites (h = 0) is the single- 
component quantity (w 1 oF ys i.e. the number of components n is n = 1. 
In the latter case, the order parameter is (x oe 1), which can point in any 
arbitrary direction (h = 0!); here, the number of components is n = 3. For 
Ji (l—V) > J\(l—U) = 0, we find the so called planar ferromagnet, in which 
the order parameter + >>, ((o7,0//,0)) has two components, n = 2. A special 
case of the uniaxial ferromagnet is the Ising model (6.5.4), with J, (J—I’) = 0. 
The uniaxial ferromagnet has the following symmetry elements: all rotations 
around the z-axis, the discrete symmetry (o/,0/,07) — (o/,07,—o7) and 
products thereof. Below T,, the invariance with respect to this discrete sym- 
metry is broken. In the planar ferromagnet, the (continuous) rotational sym- 
metry around the z-axis, and in the case of the isotropic Heisenberg model, 
the O(3) symmetry — i.e. the rotational invariance around an arbitrary axis 
— is broken. 

One could ask why e.g. for the the Ising Hamiltonian without an exter- 
nal field, (>> 01) can ever be nonzero, since from the invariance operation 
{o7} — {—-o7}, it follows that +(0,07) = —# (>), 07). In a finite system, 
x (>> 07), is analytic in h for finite h, and 


lim > OR ae (7.1.3) 


For finite N, configurations with spins oriented opposite to the field also con- 
tribute to the partition function, and their weight increases with decreasing 
values of h. 
The mathematically precise definition of the order parameter is: 
1 
See i x ( i) ; 14 
i?) ho Ned di ong) 
h 
first, the thermodynamic limit N — oo is taken, and then h — 0. This 
quantity can be nonzero below T,. For N — oo, states with the ‘wrong’ 


orientation have vanishing weights in the partition function for arbitrarily 
small but finite fields. 


7.1.3 Universality 


In the vicinity of critical points, the topology of the phase diagrams of such di- 
verse systems as a gas-liquid mixture and a ferromagnetic material are aston- 
ishingly similar; see Fig. 7.1. Furthermore, experiments and computer simula- 
tions show that the critical exponents for the corresponding phase transitions 
for broad classes of physical systems are the same and depend only on the 
number of components and the symmetry of the order parameter, the spatial 
dimension and the character of the interactions, i.e. whether short-ranged, or 
long-ranged (e.g. Coulomb, dipolar forces). This remarkable feature is termed 
universality. The microscopic details of these strongly interacting many-body 
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systems express themselves only in the prefactors (amplitudes) of the power 
laws, and even the ratios of these amplitudes are universal numbers. 
The reason for this remarkable result lies in the divergence of the cor- 


relation length € = fo( Spt) . On approaching T., € becomes the only 


relevant length scale of the system, which at long distances dominates all 
of the microscopic scales. Although the phase transition is caused as a rule 
by short-range interactions of the microscopic constituents, due to the long- 
range fluctuations (see 6.12), the dependence on the microscopic details such 
as the lattice structure, the lattice constant, or the range of the interactions 
(as long as they are short-ranged) is secondary. In the critical region, the 
system behaves collectively, and only global features such as its spatial di- 
mension and its symmetry play a role; this makes the universal behavior 
understandable. 

The universality of critical phenomena is not limited to materials classes, 
but instead it extends beyond them. For example, the static critical behavior 
of the gas-liquid transition is the same as that of Ising ferromagnets. Planar 
ferromagnets behave just like “He at the lambda point. Even without making 
use of renormalization group theory, these relationships can be understood 
with the aid of the following transformations®: the grand partition function of 
a gas can be approximately mapped onto that of a lattice gas which is equiva- 
lent to a magnetic Ising model (occupied/unoccupied cells = spin up/down). 
The Hamiltonian of a Bose liquid can be mapped onto that of a planar fer- 
romagnet. The gauge invariance of the Bose Hamiltonian corresponds to the 
two-dimensional rotational invariance of the planar ferromagnet. 


7.2 The Static Scaling Hypothesis’ 


7.2.1 Thermodynamic Quantities and Critical Exponents 


In this section, we discuss the analytic structure of the thermodynamic quan- 
tities in the vicinity of the critical point and draw from it typical conclu- 
sions about the critical exponents. This generally-applicable procedure will 
be demonstrated using the terminology of ferromagnetism. In the neighbor- 
hood of T., the equation of state according to Eq. (6.5.16) takes on the form 


® See e.g. M.E. Fisher, op. cit., and problem 7.16. 

” Although the so called scaling theory of critical phenomena can be derived micro- 
scopically through renormalization group theory (see Sect. 7.3.4), it is expedient 
for the following reasons to first introduce it on a phenomenological basis: (i) as 
a motivation for the procedures of renormalization group theory; (ii) as an il- 
lustration of the structure of scaling considerations for physical situations where 
field-theoretical treatments based on renormalization group theory are not yet 
available (e.g. for many nonequilibrium phenomena). Scaling treatments, starting 
from critical phenomena and high-energy scaling in elementary particle physics, 
have acquired a great influence in the most diverse fields. 
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1s 
iT Tm+ 3” (7.2.1) 


3 
which can be rearranged as follows: ae = sen(t 7) ie i172 + 3 i(—27) : 
Solving for m, we obtain the following dependence of m on 7 and h: 


h 
m(7, h) = [7] ?ms. (5) for T Z Te . (7.2.2) 
T 


The functions m4 for T 2 T. are determined by (7.2.1). In the vicinity of 
the critical point, the magnetization depends on 7 and h in a very special 
way: apart from the factor [7 1/?, it depends only on the ratio h/|r|?/. The 
magnetization is a generalized homogeneous function of 7 and h. This implies 
that (7.2.2) is invariant with respect to the scale transformation 


h— hb®?, 7 > 7b?, and m— mb. 


This scaling invariance of the physical properties expresses itself for example 
in the specific heat of tHe at the lambda point (Fig. 7.2). 

We know from Chap. 6 and Table 7.1 that the real critical exponents 
differ from their molecular-field values in (7.2.2). It is therefore reasonable to 
extend the equation of state (7.2.2) to arbitrary critical exponents®: 


h 
m(r,h) = |r|®ms (25) : (7.2.3) 


I7| 


in this expression, 3 and 6 are critical exponents and the m4 are called scaling 
functions. 

At the present stage, (7.2.3) remains a hypothesis; it is, however, possi- 
ble to prove this hypothesis using renormalization group theory, as we shall 
demonstrate later in Sect. 7.3, for example in Eq. (7.3.40’). For the present, 
we take (7.2.3) as given and ask what its general consequences are. The two 
scaling functions m+(y) must fulfill certain boundary conditions which follow 
from the critical properties listed in Eq. (6.5.31) and Table 7.1. The magne- 
tization is always oriented parallel to h when the applied field is nonzero and 
remains finite in the limit h — 0 below T., while above Ti, it goes to zero: 


8 In addition to being a natural generalization of molecular field theory, one can 
understand the scaling hypothesis (7.2.3) by starting with the fact that singu- 
larities are present only for 7 = 0 and h = 0. How strong the effects of the 
singularities will be depends on the distance from the critical point, 7, and on 
h/ |r|? 5 i.e. the ratio between the applied field and the field-equivalent of 7; that 
is h, = m®> = |r|®°. As long as h < hy, the system is effectively in the low- 
field limit and m ~ |7|°ms.(0). On the other hand, if r becomes so small that 
IT] < ni/é 5 then the influence of the applied field predominates. Any additional 
reduction of 7 produces no further change: m remains at the value which it had 


for |r| = h'/*9, ie. heme. (1). In the limit r > 0, ma(y) —> y® must hold, so 
that the singular dependence on 7 in m(7, h) cancels out. 
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Fig. 7.2. The specific heat at constant pressure, cp, at the lambda transition of 
“He. The shape of the specific heat stays the same on changing the temperature 
scale (1K to 107° K) 


= m_(y) =sgny, m4(0)=0. (7.2.4a) 
The thermodynamic functions are non-analytic precisely at 7 = 0,h = 0. For 
nonzero h, the magnetization is finite over the whole range of temperatures 
and remains an analytic function of 7 even for 7 = 0; the al dependence 
of (7.2.3) must be compensated by the function m+(h/ \r|°? ). Therefore, the 
two functions m+ must behave as 


lim ma(y) x y!/? (7.2.4b) 


yoo 


for large arguments. It follows from this that for 7 = 0, i.e. at the critical 
point, m ~ h!/°. The scaling functions m4(y) are plotted in Fig. 7.3. 

Eq. (7.2.3), like the molecular-field version of the scaling law given above, 
requires that the magnetization must be a generalized homogeneous function 
of 7 and fA and is therefore invariant with respect to scale transformations: 


h—o hb? 7 — rb", and m— mb?!” 


The name scaling law is derived from this scale invariance. Equation (7.2.3) 
contains additional information about the thermodynamics; by integration, 
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Fig. 7.3. The qualita 
tive behavior of the scal- 
ing functions m+ 


we can determine the free energy and by taking suitable derivatives we can 
find the magnetic susceptibility and the specific heat. From these we obtain 
relations between the critical exponents. For the susceptibility, we find the 
scaling law from Eq. (7.2.3): 


_(9m\ _| 6-58, 7 (_h 
x= ( an), =F mi.(—3) (7.2.5) 


and in the limit h — 0, we thus have y « |r|’ ~°8 Tt then follows that the 
critical exponent of the susceptibility, ~ (Eq. (6.5.31c)), is given by 


y= —B(1—6). (7.2.6) 
The specific free energy is found through integration of (7.2.3): 


h h/|r|9? 
f-fo=— |] dhm(s,h)= —[rie% f dx m+(x) . 
ho ho/|7|°° 
Here, ho must be sufficiently large so that the starting point for the inte- 
gration lies outside the critical region. The free energy then takes on the 
following form: 


A h 
f(r, h) = Faveneaae (=) aT freg : (7.2.7) 


In this expression, f is defined by the value resulting from the upper limit of 
the integral and freg is the non-singular part of the free energy. The specific 
heat at constant magnetic field is obtained by taking the second derivative 
of (7.2.7), 


Ch = 


s0F Ag|r|2C7-? + Bs . (7.2.8) 
Or? = re 

The A in this expression are amplitudes and the Bs come from the regular 

part. Comparison with the behavior of the specific heat as characterized by 

the critical exponent a (Eq. (6.5.31d)) yields 
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a=2-— (140). (7.2.9) 


The relations between the critical exponents are termed scaling relations, 
since they follow from the scaling laws for the thermodynamic quantities. If 
we add (7.2.6) and (7.2.9), we obtain 


y+28=2-a. (7.2.10) 


From (7.2.6) and (7.2.9), one can see that the remaining thermodynamic 
critical exponents are determined by ( and 6. 


7.2.2 The Scaling Hypothesis for the Correlation Function 


In the molecular field approximation, we obtained the Ornstein—Zernike be- 
havior in Eqns. (6.5.50) and (6.5.53’) for the wavevector-dependent suscepti- 
bility x(q) and the correlation function G(x): 


1 2 kT. vewl*l/é 
x(q) = eo , G(x) = eee with € = f9772 . (7.2.11) 

J@? 14+ (q€) An J |x| 
The generalization of this law is (q < a~',|x| >> a,€ > a with the lattice 
constant a): 


x(a) = > X(a8) Cla) = ay 6 (bee) ee 


qt 
(7.2.12a,b,c) 


where the functions ¥(gé) and G(|x|/€) are still to be determined. In 
(7.2.12c), we assumed that the correlation length € diverges at the critical 
point. This divergence is characterized by the critical exponent v. Just at 
T., € = co and therefore there is no longer any finite characteristic length; 
the correlation function G(x) can thus only fall off according to a power law 
G(x) ~ a G(0). The possibility of deviations from the 1/|x|-behavior of 


the Ornstein—Zernike theory was taken into account by introducing the ad- 
ditional critical exponent 7. In the immediate vicinity of T., € is the only 
relevant length and therefore the correlation function also contains the factor 
G(|x|/€). Fourier transformation of G(x) yields (7.2.12a) for the wavevector- 
dependent susceptibility, which for its part represents an evident general- 
ization of the Ornstein—Zernike expression. We recall (from Sects. 5.4.4 and 
6.5.5.2) that the increase of y(q) for small g on approaching T, leads to 
critical opalescence. 

In (7.2.11) and (7.2.12b), a three-dimensional system was assumed. Phase 
transitions are of course highly interesting also in two dimensions, and fur- 
thermore it has proved fruitful in the theory of phase transitions to consider 
arbitrary dimensions (even non-integral dimensions). We therefore generalize 
the relations to arbitrary dimensions d: 
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oe 1 
aa 


G(x) G(|x|/€) . (7.2:428") 


Equations (7.2.12a) and (7.2.12c) remain valid also in d dimensions, whereby 
of course the exponents v and 7 and the form of the functions G and x depend 


on the spatial dimension. From (7.2.12a) and (7.2.12b’) at the critical point 
we obtain 


for T=T.. (79:18) 


Here, we have assumed that G(0) and (oo) are finite, which follows from 
the finite values of G(x) at finite distances and of y(q) at finite wavenumbers 
(and € = co). 

We now consider the limiting case q — 0 for temperatures T 4 T.. Then 
we find from (7.2.12a) 


2-n 
x= Him x(q) « = = (7.2.14) 


This dependence is obtained on the basis of the following arguments: for fi- 
nite €, the susceptibility remains finite even in the limit q — 0. Therefore, 
the factor pa in (7.2.12a) must be compensated by a corresponding depen- 
dence of <(¢&), from which the relation (7.2.14) follows for the homogeneous 
susceptibility. Since its divergence is characterized by the critical exponent 
y according to (6.5.31c), it follows from (7.2.14) together with (7.2.12c) that 
there is an additional scaling relation 


y=v(2—7n). (7.2.15) 


Relations of the type (7.2.3), (7.2.7), and (7.2.12b’) are called scaling laws, 
since they are invariant under the following scale transformations: 


x—>alb, €—-€E/b, rob", h = hpes/v 


ml. fe Sf. GaGa, ey 

where f, stands for the singular part of the (specific) free energy. 

If we in addition assume that these scale transformations are based on 

a microscopic elimination procedure by means of which the original system 

with lattice constant a and N lattice sites is mapped onto a new system 

with the same lattice constant a but a reduced number Nb~¢ of degrees of 
freedom, then we find 


F, (1, h) _aFs(7b/”, hb?9/) 


oe ua ; (217) 


which implies the hyperscaling relation 
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2-a=da, (7.2.18) 


which also contains the spatial dimension d. According to equations (7.2.6), 
(7.2.9), (7.2.15), and (7.2.18), all the critical exponents are determined by 
two independent ones. 

For the two-dimensional Ising model one finds the exponents of the cor- 
relation function, vy = 1 and 7 = 1/4, from the exponents quoted following 
Eq. (6.5.31d) and the scaling relations (7.2.15) and (7.2.18). 


7.3 The Renormalization Group 


7.3.1 Introductory Remarks 


The term ‘renormalization’ of a theory refers to a certain reparametriza- 
tion with the goal of making the renormalized theory more easily dealt 
with than the original version. Historically, renormalization was developed by 
Stiickelberg and Feynman in order to remove the divergences from quantum- 
field theories such as quantum electrodynamics. Instead of the bare para- 
meters (masses, coupling constants), the Lagrange function is expressed in 
terms of physical masses and coupling coefficients, so that ultraviolet diver- 
gences due to virtual transitions occur only within the connection between 
the bare and the physical quantities, leaving the renormalized theory finite. 
The renormalization procedure is not unique; the renormalized quantities can 
for example depend upon a cutoff length scale, up to which certain virtual 
processes are taken into account. Renormalization group theory studies the 
dependence on this length scale, which is also called the “flow parameter”. 
The name “renormalization group” comes from the fact that two consecutive 
renormalization group transformations lead to a third such transformation. 

In the field of critical phenomena, where one must explain the observed 
behavior at large distances (or in Fourier space at small wavenumbers), it is 
reasonable to carry out the renormalization procedure by a suitable elimina- 
tion of the short-wavelength fluctuations. A partial evaluation of the partition 
function in this manner is easier to carry out than the calculation of the com- 
plete partition function, and can be done using approximation methods. As 
a result of the elimination step, the remaining degrees of freedom are subject 
to modified, effective interactions. 

Quite generally, one can expect the following advantages from such a 
renormalization group transformation: 


(i) The new coupling constants could be smaller. By repeated applications of 
the renormalization procedure, one could thus finally obtain a practically 
free theory, without interactions. 

(ii) The successively iterated coupling coefficients, also called “parameter 
flow”, could have a fixed point, at which the system no longer changes 
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under additional renormalization group transformations. Since the elimi- 
nation of degrees of freedom is accompanied by a change of the underlying 
lattice spacing, or length scale, one can anticipate that the fixed points 
are under certain circumstances related to critical points. Furthermore, it 
can be hoped that the flow in the vicinity of these fixed points can yield 
information about the universal physical quantities in the neighborhood 
of the critical points. 


The scenario described under (i) will in fact be found for the one-dimensional 
Ising model, and that described under (ii) for the two-dimensional Ising 
model. 

The renormalization group method brings to bear the scale invariance in 
the neighborhood of a critical point. In the case of so called real-space trans- 
formations (in contrast to transformation in Fourier space), one eliminates 
certain degrees of freedom which are defined on a lattice, and thus carries out 
a partial trace operation on the partition function. The lattice constant of the 
resulting system is then readjusted and the internal variables are renormal- 
ized in such a manner that the new Hamiltonian corresponds to the original 
one in its form. By comparison, one defines effective, scale-independent cou- 
pling constants, whose flow behavior is then investigated. We first study the 
one-dimensional Ising model and then the two-dimensional. Finally, the gen- 
eral structure of such transformations will be discussed with the derivation 
of scaling laws. A brief schematic treatment of continuous field-theoretical 
formulations will be undertaken following the Ginzburg-Landau theory. 


7.3.2 The One-Dimensional Ising Model, Decimation 
Transformation 


We will first illustrate the renormalization group method using the one- 
dimensional Ising model, with the ferromagnetic exchange constant J in zero 
applied field, as an example. The Hamiltonian is 


H=-JIS ons, (7.3.1) 
I 


where | runs over all the sites in the one-dimensional chain; see Fig. 7.4. We 
introduce the abbreviation K = J/kT into the partition function for N spins 
with periodic boundary conditions oy 41 = 01, 


Lin =e ere, <i eee (732) 
{oi=+1} 


The decimation procedure consists in partially evaluating the partition func- 
tion, by carrying out the sum over every second spin in the first step. In 
Fig. 7.4, the lattice sites for which the trace is taken are marked with a cross. 
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Fig. 7.4. An Ising chain; the trace is carried out over all the lattice points which 
are marked with a cross. The result is a lattice with its lattice constant doubled 


A typical term in the partition function is then 


Be eKor(r-1 +143) — 2 cosh K(o1-1 + o141) = e79tK't-1742 (7.3.3) 


o=t1 


with coefficients g and K’ which are still to be determined. Here, we have 
taken the sum over o; = +1 after the first equals sign. Since cosh K(oj-1 + 
0141) depends only on whether o;-1 and o;4, are parallel or antiparallel, the 
result can in any case be brought into the form given after the second equals 
sign. The coefficients g and K’ can be determined either by expansion of the 
exponential function or, still more simply, by comparing the two expressions 
for the possible orientations. If oj-1 = —o141, we find 


2=e29-K' (7.3.4a) 
and if oj, = 07141, the result is 
2cosh2K = e29tK" | (7.3.4b) 


From the product of (7.3.4a) and (7.3.4b) we obtain 4cosh2K = e49, and 
from the quotient, cosh2K = e?* ; thus the recursion relations are: 


1 

KS 5 log cosh 2K (7.3.5a) 
1 

5 (log 2+ K’) . (7.3.5b) 


Repeating this decimation procedure a total of & times, we obtain from 
(7.3.5a,b) for the kth step the following recursion relation: 


1 
K®) = > log (cosh 2K-1)) (7.3.6a) 

1 1 
gQ(K) = 5 log2+ Ae (7.3.6b) 


The decimation produces another Ising model with an interaction between 
nearest neighbors having a coupling constant K(*), Furthermore, a spin- 
independent contribution g(K*)) to the energy is generated; in the kth step, 
it is given by (7.3.6b). 

In a transformation of this type, it is expedient to determine the fixed 
points which in the present context will prove to be physically relevant. Fixed 
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points are those points K* which are invariant with respect to the transfor- 
mation, i.e. here K* = 3 log(cosh 2K*). This equation has two solutions, 


K*=0 (T=oo) and K*=o0 (T=0). (7.3.7) 


The recursion relation (7.3.6a) is plotted in Fig. 7.5. Starting with the initial 
value Ko, one obtains K'(Ko), and by a reflection in the line kK’ = K, 
K'(K'(Ko)), and so forth. One can see that the coupling constant decreases 
continually; the system moves towards the fixed point K* = 0, ie. a non- 
interacting system. Therefore, for a finite Ko, we never arrive at an ordered 
state: there is no phase transition. Only for K = oo, ie. for a finite exchange 
interaction J and T = 0, do the spins order. 


K'(K) = } log cosh 2K 


Fig. 7.5. The recur- 
sion relation for the one- 
dimensional Ising model 
with interactions between 
nearest neighbors (heavy 
solid curve), the line K’ = 
kK (dashed), and the itera- 
tion steps (thin lines with 
arrows) 


Making use of this renormalization group (RG) transformation, we can 
calculate the partition function and the free energy. The partition function 
for all together N spins with the coupling constant K, using (7.3.3), is 


Zn (K) = eNO Z y (K!) = NR) 2 HRY y (KN), (7.3.8) 


2: 


and, after the nth step, 
as 
Zy(K) = exp [w sea K) +logZy (K™)| . (7.3.9) 
k=1 


The reduced free energy per lattice site and kT is defined by 


i= ~~ log Zn(K). (7.3.10) 
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As we have seen, the interactions become weaker as a result of the renor- 
malization group transformation, which gives rise to the following possible 
application: after several steps the interactions have become so weak that 
perturbation-theory methods can be used, or the interaction can be altogether 
neglected. Setting K‘” = 0, from (7.3.9) we obtain the approximation: 


n 


s 1 1 
(PH =->, aoe”) “255 OBZ; (7.3.11) 
k=1 


since the free energy per spin of a field-free spin-1/2 system without inter- 
actions is — log 2. Fig. 7.6 shows FO (BK) for n = 1 to 5. We can see how 
quickly this approximate solution approaches the exact reduced free energy 
f(K) = —log(2cosh K). The one-dimensional Ising model can be exactly 
solved by elementary methods (see problem 6.9), as well as by using the 
transfer matrix method, cf. Appendix F. 


Fig. 7.6. The reduced 
free energy of the one- 
dimensional Ising model. 
f is the exact free energy, 
FY, FO)... are the ap- 
proximations (7.3.11) 


7.3.3 The Two-Dimensional Ising Model 


The application of the decimation procedure to the two-dimensional Ising 
model is still more interesting, since this model exhibits a phase transition 
at a finite temperature T, > 0. We consider the square lattice rotated by 45° 
which is illustrated in Fig. 7.7, with a lattice constant of one. 

The Hamiltonian multiplied by 6, H = GH, is 


H=-)Y > Kow;, (7.3.12) 


n.n. 
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. e e . a 
aK 
x x x an is 
aS .93 . é Fig. 7.7. A square spin 
lattice, rotated by 45°. The 
. uG x - lattice sites are indicated 
by points. In the decima- 
tion transformation, the spins 
e *G. °G . . v4 . : 
1 2 at the sites which are also 
- fi ee go Ny marked by a cross are elimi- 
nated. K is the interaction be- 
: 7 . - ral . tween nearest neighbors and 
L is the interaction between 
— v2 = next-nearest neighbors 


where the sum runs over all pairs of nearest neighbors (n.n.) and K = J/kT. 
When in the partial evaluation of the partition function the trace is taken 
over the spins marked by crosses, we obtain a new square lattice of lattice 
constant /2. How do the coupling constants transform? We pick out one 
of the spins with a cross, 0, denote its neighbors as 01,02,03, and o4, and 
evaluate their contribution to the partition function: 


S ek (eite2to3toa)o = 


o=2t1 (7.3.13) 
= eh + 3K (o102...t0304) +L! (o103+0204)+M'o1020304 


elos(2 cosh K(o1+02+03+04)) 


This transformation (taking a partial trace) yields a modified interaction be- 
tween nearest neighbors, K’ (here, the elimination of two crossed spins con- 
tributes); in addition, new interactions between the next-nearest neighbors 
(such as 0; and o3) and a four-spin interaction are generated: 


H'=(A 4K! Soiej tL! SO ovaj +...). (7.3.12) 
n.N. "un.N. 


The coefficients A’, A’, L’ and M’ can readily be found from (7.3.13) as func- 
tions of K, by using oj? =1,i=1,...,4 (see problem 7.2): 


1 
A’(K) = log2+ g tog cosh 44° + 4logcosh2K} , (7.3.14) 
1 1 
Ks z loscosh 4k , L'(K) = 5 K(k) (7.3.13) 


1 
M'(K) = g {log cosh4K — 4log cosh 2K} . 


Putting the critical value K. = J/kT, = 0.4406 (exact result?) into this 
relation as an estimate for the initial value K, we find M’ < L’ < Kk’. In 


° The partition function of the Ising model on a square lattice without an external 
field was evaluated exactly by L. Onsager, Phys. Rev. 65, 117 (1944), using the 
transfer matrix method (see Appendix F.). 
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the first elimination step, the original Ising model is transformed into one 
with three interactions; in the next step we must take these into account and 
obtain still more interactions, and so on. In a quantitatively usable calculation 
it will thus be necessary to determine the recursion relations for an extended 
number of coupling constants. Here, we wish only to determine the essential 
structure of such recursion relations and to simplify them sufficiently so that 
an analytic solution can be found. Therefore, we neglect the coupling constant 
M’ and all the others which are generated by the elimination procedure, and 
restrict ourselves to K’ and L’ as well as their initial values K and L. This 
is suggested by the smallness of M’ which we mentioned above. 

We now require the recursion relation including the coupling constant LD, 
which acts between o; and o4, etc. Thus, expanding (7.3.13’) up to second 
order in K and taking note of the fact that an interaction L between next- 
nearest neighbors in the original Hamiltonian appears as a contribution to 
the interactions of the nearest neighbors in the primed Hamiltonian, we find 
the following recursion relations on elimination of the crossed spins (Fig. 7.7): 


K'=2K? +L (7.3.15a) 
Peak? (7.3.15b) 


These relations can be arrived at intuitively as follows: the spin o mediates 
an interaction of the order of K times K, i.e. K? between o; and 3, likewise 
the crossed spin just to the left of o. This leads to 2K? in K’. The interac- 
tion DL between next-nearest neighbors in the original model makes a direct 
contribution to K’. Spin o also mediates a diagonal interaction between o; 
and 04, leading thus to the relation L’ = K? in (7.3.15b). 

However, it should be clear that in contrast to the one-dimensional case, 
new coupling constants are generated in every elimination step. One cannot 
expect that these recursion relations, which have been restricted as an ap- 
proximation to a reduced parameter space (K, L), will yield quantitatively 
accurate results. They do contain all the typical features of this type of re- 
cursion relations. 

In Fig. 7.8, we have shown the recursion relations (7.3.15a,b)!°. Starting 
from values (K,0), the recursion relation is repeatedly applied, likewise for 
initial values (0, L). The following picture emerges: for small initial values, the 
flux lines converge to K = L = 0, and for large initial values they converge 
to K = L=oo. These two regions are separated by two lines, which meet at 
Ki= $ and L* = z. Further on it will become clear that this fixed point is 
connected to the critical point. 

We now want to investigate analytically the more important properties of 
the flow diagram which follows from the recursion relations (7.3.15a,b). As a 


1° For clarity we have drawn in only every other iteration step in Fig. 7.8. We will 
return to this point at the end of this section, after investigating the analytic 
behavior of the recursion relation. 
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Fig. 7.8. A flow diagram 
of Eq. (7.3.15a,b) (only every 
other point is indicated.) Three 
fixed points can be recognized: 
Kk* = L* =0,K* = L* = 00 
1/3 1 K and Ki =1,1% = 


first step, the fixed points must be determined from (7.3.15a,b), i.e. K* and 
L*, which obey K* = 2K** + L* and L* = K*. These conditions give three 
fixed points 


1 
(i) K*=L*=0, (ii) K*=L"=00, and (iii) K?= 5, Lo= 


The high-temperature fixed point (i) corresponds to a temperature T = oo 
(disordered phase), while the low-temperature fixed point (ii) corresponds to 
T =0 (ordered low-temperature phase). The critical behavior can be related 
only to the non-trivial fixed point (iii), (AzZ,L%) = (4, 3). 

That the initial values of K and L which lead to the fixed point (Kz, L*) 
represent critical points can be seen in the following manner: the RG trans- 
formation leads to a lattice with its lattice constant increased by a factor of 
V2. The correlation length of the transformed system €’ is thus smaller by a 
factor of V2: 


é=€/V2. (7.3.17) 


However, at the fixed point, the coupling constants K7, L* are invariant, so 
that for € of the fixed point, we have ¢’ = € , i.e. at the fixed point, it follows 
that € = £/,/2, thus 


Co or 
arf . (7.3.18) 


The value 0 corresponds to the high-temperature and to the low-temperature 
fixed points. At finite K*,L*, € cannot be zero, but only oo. Calculating 
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back through the transformation shows that the correlation length at each 
point along the critical trajectory which leads to the fixed point is infinite. 
Therefore, all the points of the “critical trajectory”, i.e. the trajectory leading 
to the fixed point, are critical points of Ising models with nearest-neighbor 
and next-nearest-neighbor interactions. 

In order to determine the critical behavior, we examine the behavior of 
the coupling constants in the vicinity of the “non-trivial” fixed point; to this 
end, we linearize the transformation equations (7.3.15a,b) around (Kz, L*) 
in the [th step: 


6k, =hi-Kr , 6bb,;=L,-L.. (7.3.19) 
We thereby obtain the following linear recursion relation: 
OK) 4K21\ (6Ki-1 $1\ (Kin 
= = ; (7.3.20) 
6Ly 2K* 0) \6Li-1 20) \6Li-1 


The eigenvalues of the transformation matrix can be determined from 
W- FA-3=0, ie. 


1.7208 
2+ Vi0) = 7.3.21 
( ) ee ey) 


The associated eigenvectors can be obtained from (4 —(2+Vv 10)) 6k +36L=0, 


i.e. 


65L=+~ — Ace apne ende 
T2302. Ub: 
V10—2 V10+2 ( ) 
e, = (1, 3 and e2=(1,- 5 


with the scalar product e1 - eg = 3 . 


We now start from an Ising model with coupling constants Ko and Lo 
(including the division by kT’). We first expand the deviations of the ini- 
tial coupling constants Ko and Lo from the fixed point in the basis of the 
eigenvectors (7.3.21): 


K K* 
ee = ey + cyey + C22 , (7.3.22) 


with expansion coefficients cj and cp. The decimation procedure is repeated 
several times; after / transformation steps, we obtain the coupling constants 
ky and Ly : 


kK Ke 
') = ec y+ Neer + Ayc2e2 Z (7.3.23) 
Ly L* 


If the Hamiltonian H differs from H* only by an increment in the direction 
€2, the successive application of the renormalization group transformation 
leads to the fixed point, since |A2| < 1 (see Fig. 7.9). 
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L 


Fig. 7.9. Flow diagram 

based on the recursion re- 

lation (7.3.22), which is 

K i linearized around the non- 
Gg trivial fixed point (FP) 


Let us now consider the original nearest-neighbor Ising model with the 
coupling constant Ko = iz and with Lo = 0, and first determine the critical 
value K;; this is the value of Ko which leads to the fixed point. The condition 
for K., from the above considerations, is given by 


K. $ 1 
ee =(})+0-e+e(_ vinse | (7.3.24) 
9 3 
These two linear equations have the solution 
: d theref K a : 0.3979 . (7.3.25) 
co = —————.,_ and therefore ¢ = = +——=— _ = 0. . (7.3. 
*3(/ 1042) 3" 3(/10 + 2) 
For Ko = K¢, the linearized RG transformation leads to the fixed point, 
i.e. this is the critical point of the nearest-neighbor Ising model, kK. = on 


From the nonlinear recursion relation (7.3.15a,b), we find for the critical point 
the slighty smaller value K”-!- = 0.3921. Both values differ from Onsager’s 
exact solution, which gives K, = 0.4406, but they are much closer than the 
value from molecular field theory, Kk, = 0.25. 

For Ko = K., only cg # 0, and the transformation leads to the fixed 
point. For Ko # Ke, we also have cy « (Ko — Ke) = —pa(T — Te): £0. 
This increases with each application of the RG transformation, and thus leads 
away from the fixed point (7, L*) (Fig. 7.9), so that the flow runs either 
to the low-temperature fixed point (for T < T.) or to the high-temperature 
fixed point (for T > T.). 

Now we may determine the critical exponent v for the correlation length, 
beginning with the recursion relation 


(K — K.)' =\(K — K-) (7.3.26) 
and writing ; as a power of the new length scale 
Ar = (V2) (7.3.27) 


For the exponent y; defined here, we find the value 
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log Ay 
log 2 


From ¢’ = €//2 (Eq. (7.3.17)), it follows that (K’— K.)~” = (K—K,)-"/v2, 
i.e. 


yy =2 = 1.566. (7.3.28) 


jl 
(K’ — K.) = (V2)"(K — K,). (7.3.29) 
Comparing this with the first relation (7.3.26), we obtain 
1 
v= — = 0.638 . (7.3.30) 
Y1 


This is, to be sure, quite a ways from 1, the known exact value of the two- 
dimensional Ising model, but nevertheless it is larger than 0.5, the value 
from the molecular-field approximation. A considerable improvement can be 
obtained by extending the recursion relation to several coupling coefficients. 

Let us now consider the effect of a finite magnetic field h (including the 
factor 3). The recursion relation can again be established intuitively. The 
field h acts directly on the remaining spins, as well as a (somewhat underes- 
timated) additional field Kh which is due to the orienting action of the field 
on the eliminated neighboring spins, so that all together we have 


hi =h+Kh. (7.3.31) 


The fixed point value of this recursion relation is h* = 0. Linearization around 
the fixed point yields 


4 
h’=(1+ K*)h= 3” : (7.3.32) 
thus the associated eigenvalue is 
4 
An = 5 (7.3.33) 


Ko—K-. (or T—T, ) and h are called the relevant “fields”, since the eigenval- 
ues A; and A, are larger than 1, and they therefore increase as a result of the 
renormalization group transformation and lead away from the fixed point. In 
contrast, c2 is an “irrelevant field”, since |A2| < 1, and therefore cp becomes 
increasingly smaller with repeated RG transformations. Here, “fields” refers 
to fields in the usual sense, but also to coupling constants in the Hamilto- 
nian. The structure found here is typical of models which describe critical 
points, and remains the same even when one takes arbitrarily many coupling 
constants into account in the transformation: there are two relevant fields 
(T — T, and h, the conjugate field to the order parameter), and all the other 


fields are irrelevant. 

We add a remark concerning the flow diagram 7.9. There, owing to the negative sign 
of A2, only every other point is shown. This corresponds to a twofold application of 
the transformation and an increase of the lattice constant by a factor of 2, as well as 
1 — AZ, A2 — AZ. Then the second eigenvalue A3 is also positive, since otherwise 
the trajectory would move along an oscillatory path towards the fixed point. 
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7.3.4 Scaling Laws 


Although the decimation procedure described in Sect. 7.3.3 with only a few 
parameters does not give quantitatively satisfactory results and is also un- 
suitable for the calculation of correlation functions, it does demonstrate the 
general structure of RG transformations, which we shall now use as a starting 
point for deriving the scaling laws. 

A general RG transformation R maps the original Hamiltonian H onto a 
new one, 


HH! = RH. (7.3.34) 


This transformation also implies the rescaling of all the lengths in the prob- 
lem, and that N’ = Nb~¢ holds for the number of degrees of freedom N in d 
dimensions (here, b = V2 for the decimation transformation of 7.3.1). 

The fixed-point Hamiltonian is determined by 


R(H*) = H*. (7.3.35) 
For small deviations from the fixed-point Hamiltonian, 
R(H* + 6H) =H* + L5H , 


we can expand in terms of the deviation 6H. From the expansion, we obtain 
the linearized recursion relation 


LOH = 5H . (7.3.36a) 


The eigenoperators 6H{1, dH2,... of this linear transformation are determined 
by the eigenvalue equation 


LEH; = 0H; - (7.3.36b) 


A given Hamiltonian H, which differs only slightly from H*, can be repre- 
sented by H{* and the deviations from it: 


H =H" + 76H, + hOHn + S > cidH: , (7.3.37) 
i>3 
where 6H, and dH; denote the two relevant perturbations with 
|A;| = 68 > 1, |Ap| = b% > 1; (7.3.38) 
they are related to the temperature variable 7 = tt and the external 


field h, while |A;| = 6% < 1 and thus y,; < 0 for j > 3 are connected with 
the irrelevant perturbations.'' The coefficients 7, h, and c; are called scaling 


"™ Compare the discussion following Eq. (7.3.33). The (only) irrelevant field there 
is denoted by cz. In the following, we assume that A; > 0. 
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fields. For the Ising model, 6H; = >>) 01. Denoting the initial values of the 
fields by c;, we find that the free energy transforms after | steps to 


Fy (ci) = Frypa(cidj) - (7.3.39a) 
For the free energy per spin, 
1 
Fle) = 7 Fw(ei) 5 (7.3.39b) 
we then find in the linear approximation 
Fhe fel i eo) (7.3.40) 


Here, we have left off an additive term which has no influence on the following 
derivation of the scaling law; it is, however, important for the calculation of 
the free energy. The scaling parameter / can now be chosen in such a way 
that |r|b¥*! = 1, which makes the first argument of f equal to +1. Then we 
find 


f(t hy eg...) = [rl fa (|r|! , cal!) , (7.3.40) 


where fe(z,y,-.-) = f(tl,z,y,...) and y-,yn > 0,y3,... < 0. Close to 
T., the dependence on the irrelevant fields c3,... can be neglected, and 
Eq. (7.3.40’) then takes on precisely the scaling form (Eq. 7.2.7), with the 
conventional exponents 


B6 = Yn/Yr (7.3.41a) 
and 
2-a= “ (7.3.41b) 


Taking the derivative with respect to h yields 


d— d—2 
= Yn and pee ai Le 
Ur Ur 


We have thus derived the scaling law, Eq. (7.2.7), within the RG theory for 
fixed points with just one relevant field, along with the applied magnetic field 
and the irrelevant operators. Furthermore, the dependence on the irrelevant 
fields cg,... gives rise to corrections to the scaling laws, which must be taken 
into account for temperatures outside the asymptotic region. 

In order to make the connection between y, and the exponent v, we recall 
that | iterations reduce the correlation length to ¢’ = b~'€, which implies that 
(rbY7")—” = b-'7—” and, as a result, 


B (7.3.41c,d) 


y= — (7.3.41e) 
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Fig. 7.10. The critical hypersur- 
face. A trajectory within the crit- 
ical hypersurface is shown as a 
dashed curve. The full curve is a 
trajectory near the critical hyper- 
surface. The coupling coefficients 
of a particular physical system as 
' a function of the temperature are 
! indicated by the long-dashed curve 


(cf. Eq. (7.3.30) for the two-dimensional Ising model). From the existence of 
a fixed-point Hamiltonian with two relevant operators, the scaling form of 
the free energy can be derived, and it is also possible to calculate the critical 
exponents. Even the form of the scaling functions 3 and m can be computed 
with perturbation-theoretical methods, since the arguments are finite. A sim- 
ilar procedure can be applied to the correlation function, Eq. (7.2.12b’). At 
this point it is important to renormalize the spin variable, a’ = bSa, whereby 
it is found that setting the value 


¢ =(d—-2+7)/2 (7.3.41f) 


guarantees the validity of (7.2.13) at the critical point. 

We add a few remarks about the generic structure of the flow diagram in 
the vicinity of a critical fixed point (Fig. 7.10). In the multidimensional space 
of the coupling coefficients, there is a direction (the relevant direction) which 
leads away from the fixed point (we assume that h = 0). The other eigen- 
vectors of the linearized RG transformation span the critical hypersurface. 
Further away from the fixed point, this hypersurface is no longer a plane, 
but instead is curved. The trajectories from each point on the critical hyper- 
surface lead to the critical fixed point. When the initial point is close to but 
not precisely on the critical hypersurface, the trajectory at first runs parallel 
to the hypersurface until the relevant portion has become sufficiently large 
so that finally the trajectory leaves the neighborhood of the critical hyper- 
surface and heads off to either the high-temperature or the low-temperature 
fixed point. For a given physical system (ferromagnet, liquid, ...), the param- 
eters T,c3,... depend on the temperature (the long-dashed curve in Fig. 7.10). 
The temperature at which this curve intersects the critical hypersurface is 
the transition temperature Ty. 

From this discussion, the universality properties should be apparent. All 
systems which belong to a particular part of the parameter space, i.e. to the 
region of attraction of a given fixed point, are described by the same power 
laws in the vicinity of the critical hypersurface of the fixed point. 
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*7.3.5 General RG Transformations in Real Space 


A general RG transformation in real space maps a particular spin system {co} 
with the Hamiltonian H{o}, defined on a lattice, onto a new spin system with 
fewer degrees of freedom (by N’/N = b~¢) and a new Hamiltonian H’{o’}. 
It can be represented by a transformation T{o’,o}, such that 


eC H0'} = ST fa! ape M43 (7.3.42) 
{o} 

with the conditions 
Se (7.3.43a) 
{o’} 

and 
So T{o',o} =1, (7.3.43b) 
{o’} 

which guarantee that 


—H'{o'} = 


eS Tr rye Tr gaye tb} (7.3.44a) 


is fulfilled (Tr ¢,} = )7¢,})- This yields a relation between the free energy F 
of the original lattice and the free energy F” of the primed lattice: 


F4+G=F. (7.3.44b) 


The constant G is independent of the configuration of the {o’} and is deter- 
mined by equation (7.3.43a). 

Important examples of such transformations are decimation transforma- 
tions, as well as linear and nonlinear block-spin transformations. The simplest 
realization consists of 


1 
T{o', a} = Hyea's (1 + ou tiv(o)) ; (7.3.45) 


where 2 denotes the lattice sites of the initial lattice and §2’ those of the new 
lattice, and the function t,(o0) determines the nature of the transformation. 
a) Decimation Transformation (Fig. 7.11) 


ty{o} = Cox 
b= v2 , ¢= pd-2+n)/2 


where ¢ rescales the amplitude of the remaining spins. 
Then, 


(7.3.46a) 


(a200) = C(o%, 09) - 
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» » 


Fig. 7.11. A decimation transforma- 
tion 


3) Linear Block-Spin Transformation (on a triangular lattice, Fig.7.12) 


ty{a} = p(o}, + 02 + 03) 


1 


7.3.46b 
b= V3, p43) Sas, ( ) 


aA 


mK 
Ling, 
WAN 


Fig. 7.12. A block-spin transformati- 
on 


) Nonlinear Block-Spin Transformation 
ty{o} = plop +02 +03) + qopope3 . (7.3.46c) 


An important special case 


P= d= 55 % =sen(oy + oy +07). 

These so called real-space renormalization procedures were introduced by 
Niemeijer and van Leeuwen!?. The simplified variant given in Sect. 7.3.3 is 
from!3. The block-spin transformation for a square Ising lattice is described 
in!*. For a detailed discussion with additional references, we refer to the 
article by Niemeijer and van Leeuwen!». 

2 Th. Niemeijer and J.M. J. van Leeuwen, Phys. Rev. Lett. 31, 1411 (1973). 

3 K.G. Wilson, Rev. Mod. Phys. 47, 773 (1975). 

‘4 M. Nauenberg and B. Nienhuis, Phys. Rev. Lett. 33, 344 (1974). 

'® Th. Niemeijer and J.M. J. van Leeuwen, in Phase Transitions and Critical Phe- 
nomena Vol. 6, Eds. C. Domb and M.S. Green, p. 425, Academic Press, London 
1976. 
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“7.4 The Ginzburg-Landau Theory 


7.4.1 Ginzburg—Landau Functionals 


The Ginzburg-Landau theory is a continuum description of phase transitions. 
Experience and the preceding theoretical considerations in this chapter show 
that the microscopic details such as the lattice structure, the precise form of 
the interactions, etc. are unimportant for the critical behavior, which mani- 
fests itself at distances which are much greater than the lattice constant. Since 
we are interested only in the behavior at small wavenumbers, we can go to a 
macroscopic continuum description, roughly analogous to the transition from 
microscopic electrodynamics to continuum electrodynamics. In setting up the 
Ginzburg-Landau functional, we will make use of an intuitive justification; a 
microscopic derivation is given in Appendix E. (see also problem 7.15). 

We start with a ferromagnetic system consisting of Ising spins (n = 1) on 
a d-dimensional lattice. The generalization to arbitrary dimensions is interest- 
ing for several reasons. First, it contains the physically relevant dimensions, 
three and two. Second, it may be seen that certain approximation methods 
are exact above four dimensions; this gives us the possibility of carrying out 
perturbation expansions around the dimension four (Sect. 7.4.5). 

Instead of the spins S; on the lattice, we introduce a continuum magne- 
tization 


m(x) = Kad d g(x — x1) S) . (7.4.1) 


Here, g(x — x;) is a weighting function, which is equal to one within a cell 
with N spins and is zero outside it. The linear dimension of this cell, a,., is 
supposed to be much larger than the lattice constant ag but much smaller 
than the length L of the crystal, i.e. a9 < ae < L. The function g(x — x:) 
is assumed to vary continuously from the value 1 to 0, so that m(x) is a 
continuous function of x; see Fig. 7.13. 


g(y) 


Fig. 7.13. The weighting function 
g(y) along one of the d cartesian co- 
a ordinates 


Making use of 


[ateatx —x,) = Nad 
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and of the definition (7.4.1), we can rewrite the Zeeman term as follows: 


S- hS, = hyS- aa | tioatx — x;)S, = [ atch) ‘ (7.4.2) 
l G80 


From the canonical density matrix for the spins, we obtain the probability 
density for the configurations m(x). Generally, we have 


Plm(x)] = (6(m(x) — — So g(x —x)51)). (7.4.3) 
0 1 


For P[m(x)], we write 
Plm(x)] oc e~FlMOOMAT (7.4.4) 


in which the Ginzburg-Landau functional F[m(x)] enters; it is a kind of 
Hamiltonian for the magnetization m(x). The tendency towards ferromag- 
netic ordering due to the exchange interaction must express itself in the form 
of the functional F[m(x)] 


Fi[m(x)| = as (aim?(x) + sm*(x) + c(Vm(x))* - hmn(x)) . (7.4.5) 
In the vicinity of T., only configurations of m(x) with small absolute values 
should be important, and therefore the Taylor expansion (7.4.5) should be 
allowed. Before we turn to the coefficients in (7.4.5), we make a few remarks 
about the significance of this functional. 

Due to the averaging (7.4.1), short-wavelength variations of S; do not con- 
tribute to m(x). The long-wavelength variations, however, with wavelengths 
larger than az, are reflected fully in m(x). The partition function of the mag- 
netic system therefore has the form 


Z = Z)(T) ‘| D[m(x)Je77 Oo /kr (7.4.6) 


Here, the functional integral { D[m(x)]... refers to a sum over all the pos- 
sible configurations of m(x) with the probability density e~71™@)l/FT, One 
can represent m(x) by means of a Fourier series, obtaining the sum over 
all configurations by integration over all the Fourier components. The fac- 
tor Zo(T) is due to the (short-wavelength) configurations of the spin system, 
which do not contribute to m(x). The evaluation of the functional integral 
which occurs in the partition function (7.4.6) is of course a highly nontrivial 
problem and will be carried out in the following Sections 7.4.2 and 7.4.5 using 
approximation methods. The free energy is 


F =—kTlogZ. (7.4.7) 
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We now come to the coefficients in the expansion (7.4.5). First of all, this 
expansion took into account the fact that F[m(x)] has the same symmetry as 
the microscopic spin Hamiltonian, i.e. aside from the Zeeman term, F[m/(x)] 
is an even function of m(x). Owing to (7.4.2), the field h expresses itself only 
in the Zeeman term, — { d“x hm(x), and the coefficients a,b,c are indepen- 
dent of h. For reasons of stability, large values of m(x) must have a small 
statistical weight, which requires that b > 0. If for some system b < 0, the 
expansion must be extended to higher orders in m(x). These circumstances 
occur in first-order phase transitions and at tricritical points. The ferromag- 
netic exchange interaction has a tendency to orient the spins uniformly. This 
leads to the term cVmVm with c > 0, which suppresses inhomogeneities in 
the magnetization. 

Finally, we come to the values of a. For h = 0 and a uniform m(x) = m, 
the probability weight e~°7 is shown in Fig. 7.14. 


a>0 | e* a<0] ef 


m m 


Most Probable State 
m=0 mz#0 


(a) (b) 


Fig. 7.14. The probability density e~°7 as a function of a uniform magnetization. 
(a) Fora > 0 (Tf > T?) and (b) fora <0 (T < T?) 


When a > 0, then the most probable configuration is m = 0; when a < 0, 
then the most probable configuration is m 4 0. Thus, a must change its sign, 


a=a(T-T), (7.4.8) 


in order for the phase transition to occur. Due to the nonlinear terms and to 
fluctuations, the real T, will differ from T°. The coefficients b and c are finite 
at Te. 

If one starts from a Heisenberg model instead of from an Ising model, the 
replacements 


SiS; and m(x) — m(x) 


m*(x) + (m(x)?) (Vm)? > VemVam . (7.4.9) 


must be made, leading to Eq. (7.4.10). Ginzburg-Landau functionals can 
be introduced for every type of phase transition. It is also not necessary to 
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attempt a microscopic derivation: the form is determined in most cases from 
knowledge of the symmetry of the order parameter. Thus, the Ginzburg— 
Landau theory was first applied to the case of superconductivity long before 
the advent of the microscopic BCS theory. The Ginzburg-Landau theory was 
also particularly successful in treating superconductivity, because here simple 
approximations (see Sect. 7.4.2) are valid even close to the transition (see also 
Sect. 7.4.4). 


7.4.2 The Ginzburg-Landau Approximation 


We start with the Ginzburg-Landau functional for an order parameter with 
n components, m(x), n = 1,2,...,: 


F(m(x)| = | de [amm? (xe) + 56(m(x)2)” + o( Vm)? — h(x) (x) . (7.4.10) 


The integration extends over a volume L%. The most probable configuration 
of m(x) is given by the stationary state which is determined by 


OF 


me ee ee Ge (7.4.11) 


Let h be independent of position and let us take h to lie in the x-direction 
without loss of generality, h = hey,(h 2 0); then the uniform solution is 
found from 


2(a + bm”)m — he; = 0. (7.4.12) 


We discuss special cases: 


(i) h +0: spontaneous magnetization and specific heat 
When there is no applied field, (7.4.12) has the following solutions: 


m=0 for a>0O 


= (7.4.13) 
(m=0) and m=+eim, m=\/— for a<0O. 
The (Gibbs) free energy for the configurations (7.4.13) is!® 
F(T, h =0) = F[0] =0 for, TS 7? (7.4.14a) 
1 2 
F(T,h =0) = Fim] = ee for T<T°. (7.4.14b) 


'6 Instead of really computing the functional integral f Dim(x)Je77 be Ool/kP as is 


required by (7.4.6) and (7.4.7) for the determination of the free energy, m(x) 
was replaced everywhere by its most probable value. 
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We will always leave off the regular term Fyeg = —kT log Zp. The state m = 0 
would have a higher free energy for T < T° than the state mo; therefore, 
m = 0 was already put in parentheses in (7.4.13). For T < T°, we thus 
find a finite spontaneous magnetization. The onset of this magnetization is 
characterized by the critical exponent (3, which here takes on the value 3 = s 
(Fig. 7.15). 


Fig. 7.15. The spontaneous magnetization 
in the Ginzburg-Landau approximation 


Specific Heat 
From (7.4.14a,b), we immediately find the specific heat 


Os OF 0 i hee! 
L*¢,=9 = T| — = -T| —— = j © (tAd5 
a Ge eae ee ae 


with a’ from (7.4.8). The specific heat exhibits a jump 
al” 
Agi = = ‘ (7.4.16) 
and the critical exponent a is therefore zero (see Eq. (7.1.1)), a = 0. 


(ii) The equation of state for h > 0 and the susceptibility 
We decompose m into a longitudinal part, e1m 1, and a transverse part, 
m_ = (0,mg,...,mn). Evidently, Eq. (7.4.12) gives 


m, =0 (7.4.17) 


and the magnetic equation of state 


h=2(atbmi)m, . (7.4.18) 
We can simplify this in limiting cases: 
a). P= Te 
h=2bm? ie. 6=3. (7.4.19) 
fe) Na dae Bs 
h 3 
my = — +O(h3). (7.4.20) 
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anh. 


my, = mosgn(h)+ Am _ yields 


my, = mo sgn (h) 4 + O(h?sgn(h)) 


Abme 


= mo segn(h) + — + O(h?sgn(h)) . (7.4.21) 


We can now also calculate the magnetic susceptibility for h = 0, either by 
differentiating the equation of state (7.4.18) 


Om, 


Oh 


or directly, by inspection of (7.4.20) and (7.4.21). It follows that the isother- 
mal susceptibility is given by 


“a = )% 7.4.22 
me ( on i ee ATs epee) 


=1 


2(a + 3bm{) 


The critical exponent y has, as in molecular field theory, a value of y = 1. 


7.4.3 Fluctuations in the Gaussian Approximation 
7.4.3.1 Gaussian Approximation 


Next we want to investigate the influence of fluctuations of the magnetization. 
To this end, we first expand the Ginzburg-Landau functional in terms of the 
deviations from the most probable state up to second order 


m(x) = mje, + m’(x) , (7.4.23) 
where 
m/(x) = L~*/? 5° me'** (7.4.24) 
keB 


characterizes the deviation from the most probable value. Because of the 
underlying cell structure, the summation over & is restricted to the Brillouin 
zone B: -2 < ki < =. The condition that m(x) be real yields 


m,=m_,. (7.4.25) 


A)T>T® andh=0: 


In this region, m, = 0, and the Fourier series (7.4.24) diagonalizes the har- 
monic part F;, of the Ginzburg-Landau functional 
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i= pee (am"” + c(Vm’)”) — Sila + ck?)m,m_, . (7.4.26) 
k 


We can now readily calculate the partition function (7.4.6) in the Gaussian 
approximation above T°: 


Zo= Zo f [dime ; (7.4.27) 
k 


We decompose mz, into real and imaginary parts, finding for each k and each 
of the n components of m; a Gaussian integral, so that 


Za= 701] (lx / at =) (7.4.28) 


results, and thus the free energy (the stationary solution m; = 0 makes no 
contribution) is 


n T 
F(T,0) = Fo —kT = ) log ———__.. . 4.2 
( ,0) 0 2 - 0g Bla a ck?) (7 9) 
The specific heat, using )>),--- = ay z fd¢k... and Eq. (7.4.8), is then 
@F/L¢ on, of dk 1 
Ch=0 = —T =k~(Ta’ / oe 7.4.30 
ae oT” 7 (2m)? (a + ck?)? oe 


The dots stand for less singular terms. We define the quantity 


a x 
fa /z=(5 ays (7.4.31) 


which diverges in the limit T — T° and will be found to represent the cor- 
relation length in the calculation of the correlation function (7.4.47). By 
introducing gq = &k into (7.4.30) as a new integration variable, we find the 
singular part of the specific heat 


ei Ave (7.4.32) 


with the amplitude 


Aaa a), Vs, a ay 7H ; (7.4.33) 


Here, the radius A of the Brillouin sphere enters; it is introduced at the end of 
Appendix E. The amplitude A, characterizes the strength of the singularity 
above T,. Here and in the following, d-dimensional integrals of the type 
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d 
Nice eoled bor m ve 


occur, where 


Ka= / = = (2-1ner(S)) (7.4.34b) 


is the surface of a d-dimensional unit sphere divided by (27). In the further 
evaluation of (7.4.32) and (7.4.33), the three cases 
d<4, d=4, d>4 must be distinguished: 


d<A4 
AE q?} oo qt! co ae aes 
dq ——— =i ag 1, - | dqq°° = finite + O((Ag) = ) 
| (l+¢@?)? Jo (Q+@)? Jae 
—_S 
(Ag) 4-4 
d=4 


AE 3 AE 
0 (1+ 4?) q 


AE a=1 AE 
qd d—5 d—5 
dq ( q ) / dq q 
if (1+ 4)? 0 


3) ee ee 
0 (1 + q?)? d—4 


The overall result is summarized in (7.4.35): 


4—d 
A,(T—-T9)7? de4 
Chg = \ ~ log(T — T°) - d=4 (7.4.35) 
A-B(T—T)* d>4. 


For d < 4, the specific heat diverges at T.; for d > 4, it exhibits a cusp. The 
amplitude A+ for d < 4 is given by 


d 

n qa! 2 foe) qc} 

Ag Te (=) Ka | dg——,, . (7.4.36) 
a Ne 0  (1+¢@)’ 

Below d = 4, the critical exponent of the specific heat is (enzo ~ (T — Te) *) 


(4—d); (7.4.37) 


Qe 


Nl rR 
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in particular, for d = 3 in the Gaussian approximation, a = 5. Comparison 


with exact results and experiments shows that the Gaussian approximation 
overestimates the fluctuations. 


B) T < T° 
Now we turn to the region T < T° and distinguish between the longitudinal 
(m,) and the transverse components (m;) 

mix) =m, +mi(x2), mi(z)=mi(x) for i>2 (7.4.38) 


with the Fourier components m4, and m/,,, where the latter are present only 
for n > 2. In the present context, including non-integer values of d, vectors 
will be denoted by just «, etc. From (7.4.10), we find for the Ginzburg-Landau 
functional in second order in the fluctuations: 


Fria] = F [mi] 1+ E|(- Dat =~ + O(N?) + ck?) iy 


+ (s- + ek?) S~ pmie? hb 


i>2 


(7.4.39) 


To arrive at this expression, the following ancillary calculation was used: 


2 ! o 2 
a (mi +2mim,+myp + mi) 


+; (mi +4mimi, + 6mm? + amimi ) — h(mi +m}) 
b 
= =amj + =mi — hmi + (a + 3bm}) mi? + (a+ bm?) mi . 
2 Re 
ae 


Analogously to the computation leading from (7.4.26) to (7.4.29), we find for 
the free energy of the low-temperature phase at h = 0 


F(T,0) = Fo(T,h) + Fe.t.(T,0)— 


T . (7.4.40) 
— TS (lo8 7 Baal + ck) (n Dee. =} : 


The first term results from Zo; the second from F [m1], the stationary solution 
considered in the Ginzburg-Landau approximation; the third term from the 
longitudinal fluctuations; and the fourth from the transverse fluctuations. The 
latter do not contribute to the specific heat, since their energy is temperature 
independent for h = 0: 
Oe  o2 a? 4nd 
cna0 = T+ Avge t= eae AAT P) Fe (7.4.41) 


where the low-temperature correlation length 
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4 
= 2\a| = PO \ 2 a0 -1/2 0 
he ( 21) = (=<) Payee, er (7.4.42) 


is to be inserted. The amplitudes in (7.4.23) and (7.4.41) obey the relations 


é 4 ~ 
A_=-A A_ = —A,. 7.4.43 

n +> n + ( ) 

The ratio of the amplitudes of the singular contribution to the specific heat 
depends only on the number of components n and the spatial dimension d, 
and is in this sense universal. The transverse fluctuations do not contribute 
to the specific heat below T.; therefore, the factor + enters the amplitude 


. n 
ratio. 


7.4.3.2 Correlation Functions 
We now calculate the correlation functions in the Gaussian approximation. 
We start by considering T > T° . In order to calculate this type of quantity, 


with which we shall meet up repeatedly later, we introduce the generating 
functional 


1 
Zih] = d —BFr+d hem_r 
[h] 7) Mp e 


1 2 2 
_ | | —B>, (a+ck*)|mz,|7+hpm— 
7 me) k ee — 


To evaluate the Gaussian integrals in (7.4.44), we introduce the substitution 


(7.4.44) 


1 ¥: 

Tie = mx — 55 (a+ ck?) "hp , (7.4.45) 
obtaining 

Zh] =e = ba (7.4.46) 

= ex a Sa = 5 oo. 
' 4G 7 atch * —* 
Evidently, 
o2 
MeM_pK) = ————_ Zh ; 
MOOR Bigs No 


from which we find the correlation function by making use of (7.4.46): 


(M~pM_ pf") = Ok! = Ok,k! G(k) . (7.4.47) 


1 
25(a+ ck?) 
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Here, we have taken into account the fact that in the sum over k in (7.4.46), 
each term hyh_, = h_~hz occurs twice. From the last equation, the mean- 
ing of the correlation length (7.4.31) becomes clear, since in real space, 
Eq. (7.4.47) gives 


I\\ ab ik(a—2’) 1 = d¢k eik(a—2') 
(m(x)m(2")) = +4 » e 2B(a+ ck?) i (2r)* 2Bc(E-? + k?) 


7 2-4 f dq eig(a—2")/€ 
2Be Jgcng (Qn)? 1+@? 
(7.4.48) 


The correlation length is characterized by the critical exponent v = 5. For 
T = T°, one can see immediately from the second expression that 


1 


jx —a'|*-? 


(m(a)m(z"')) ~ (7.4.49) 


i.e. the exponent 7 introduced in (7.2.13) is zero in this approximation: 7 = 0. 
In three dimensions, we find from (7.4.48) the Ornstein—Zernike correlation 
function: 


(m{xe)m(x’)) = x (7.4.50) 
m(x)m(x')) = —-——  ,_ r=|x-x |. A, 
87rbce 7 
Remark: The correlation function (7.4.47) obeys 
lim G(k) = kT xr , (7.4.51) 
where yr is the isothermal susceptibility (7.4.22a). 


For T < T° we distinguish for n > 1 between the longitudinal correlation 
function and the transverse (¢ > 2) correlation function: 


Gi (k) = (mi,mi_,) and G1(k) = (miemi_n) - (7.4.52) 


For n = 1, only Gj(k) is relevant. From (7.4.39), it follows in analogy to 
(7.4.47) that 


1 h—0 1 

Gi(k) = 7.4.53 

\) = eee Eee) DORA —H+ay =A) 

and 
1 rao ol 

G.(k) = 7.4.54 

1() 23[ 2 + ck] 2Bck? as) 
eae (7.4.54) 
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The divergence of the transverse susceptibility (correlation function) 
(7.4.54a) at h = 0 is a result of rotational invariance, owing to which it 
costs no energy to rotate the magnetization. 

We first want to summarize the results of the Gaussian approximation, 
then treat the limits of its validity, and finally, in Sect. 7.4.4.1, to discuss the 
form of the correlation functions below T° in a more general way. 

In summary for the critical exponents, we have: 


d 1 1 
OFluct =2— 5,8 oi 1,6 3,V gil 0 (7.4.55) 


and for the amplitude ratios of the specific heat, the longitudinal correlation 
function and the isothermal susceptibility: 


~ C, 
sees —=1 d —=2. 7.4.56 
, and 5 (7.4.56) 


The amplitudes are defined in (7.4.32), (7.4.41), (7.4.57), and (7.4.58): 


ese Cua (7.4.57) 
“14 (Ek)? Be’ 


MSO T; TS Te (7.4.58) 


7.4.3.3 Range of Validity of the Gaussian Approximation 


The range of validity of the Gaussian approximation and of more elabo- 
rate perturbation-theoretical calculations can be estimated by comparing the 
higher orders with lower orders. For example, the fourth order must be much 
smaller than the second, or the Gaussian contribution to the specific heat 
must be smaller than the stationary value. The Ginzburg-Landau approxi- 
mation is permissible if the fluctuations are small compared to the stationary 
value, i.e. from Eqns. (7.4.16) and (7.4.41), 


Fae 
Ac > 4-4 (=) oe (7.4.59) 
Cc 
where NV is a numerical factor. Then we require that 
N 
al? 7.4.60 
T > Ac ( ) 


: T-T° 
with + = rae and £) = Pilg 


For dimensions d < 4, the Ginzburg-Landau approximation fails near 


T°. From (7.4.60), we find a characteristic temperature Tg, = (aes 
0 
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Table 7.3. The correlation length and the critical region 


Superconductors!” fo ~ 10° A TOL = 1071° — 19714 
Magnets fo ~ A TOL ~ 107? 
A— Transition fo ~ 4 A Tat ~ 0.3 


the so called Ginzburg—Levanyuk temperature; it depends on the Ginzburg— 
Landau parameters (see Table 7.3). 

In this connection, d, = 4 appears as a limiting dimension (upper criti- 
cal dimension). For d < 4, the Ginzburg-Landau approximation fails when 
T <Tg@z- It is then no longer sufficient to add the fluctuation contribution; 
instead, one has to take interactions between the fluctuations into account. 
Above four dimensions, the corrections to the Gaussian approximation on 
approaching T? become smaller, so that there, the Gaussian approximation 
applies. For d > 4, the exponent of the fluctuation contribution is negative, 


from Eq. (7.4.35): QFiuct < 0. Then the ratio can be cheo(Te) 21. 


Cc 


7.4.4 Continuous Symmetry and Phase Transitions of First Order 
7.4.4.1 Susceptibilities for T < Ty 


A) Transverse Susceptibility 


We found for the transverse correlation function(7.4.54a) that G.i(k) = 


Tago] and we now want to show that the relation G.(0) = Pm is 
a general result of rotational invariance. To this end, we imagine that an 
external field h acts on a ferromagnet. Now we investigate the influence of 


an additional infinitesimal, transverse field dh which is perpendicular to h 


bh? 
\/ (b+ bh)? = Wh? + 6h? =h+ ope tt 
Thus, the magnitude of the field is changed by only O(6h?); for a small dh, 
this is equivalent to a rotation of the field through the angle oh (Fig. 7.16). 


oh 
h m / m+m —- Fig. 7.16. The field h and the addi- 
tional, infinitesimal transverse field 6h 


” According to BCS theory, & ~ 0.18 F . In pure metals, m = me, ur = 108 mT. 


is low, 0 = 1000 — 16.000 A. The A-15 compounds NbzSn and V3Ga have flat 
bands, so that m is large, ur = 10° =, Te is higher, and 9 = 50 A. The situation 
is different in high-T, superconductors; there, £9 ~ A. 
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The magnetization rotates through the same angle; this means that om = gh 
and we obtain for the transverse susceptibility, 
6m som 
=—_=—_. 7.4.61 
XL oh i ( ) 


The transverse correlation function in the Gaussian approximation (7.4.54a) 
is in agreement with this general result. 


Remarks concerning the spatial dependence of the transverse correlation func- 
tion G(r): 


(i) 


1 f dtk elke é,\o? wee 
Gulnh=0)= a5 | oa A = 4,(%) » €1 = (28c) 


(7.4.62) 


Employing the volume element d¢ k = dkk@~!(sin 9)? d@dQy_1, the integral 
in (7.4.62) becomes 


Qa-1 [ d—1 1 ri ik 0/7: d—2 
dkk eT COS" (sin 8 do 
(27)* Jo 28ck? Jo isin 6) 


iad d Sa. kr 
= a | dewi-p (4_1) p (1) o¢att i 
28c2nr Jo 2 2 2 (kr)?~* 


d—2) 18 


~N pC 


For dimensional reasons, G',(r) must be of the form 


G(r) ~00(8), 


i.e. the transverse correlation length from Eq. (7.4.62) is 
E.= éMa2 ne ages = pv/(d—2) , (7.4.63) 


where the exponent was rearranged using the scaling relations. 
(ii) We also compute the local transverse fluctuations of the magnetization 
from 


0) de dk k@-1 2m1 | =d+2 / 2M A qa 
0 0 


oe — dq —; 
a; + ck? h Tite 


'8 1S. Gradshteyn and I.M. Ryzhik, Table of Integrals, Series, and Products (Aca- 
demic Press, New York 1980), Eq. 8.411.7 
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and consider the limit h — 0: the result is 


finite for d>2 
logh for d=2 


254 no : 
(4) 7 Sco ifm, #0 for d<2. 


For d < 2, the transverse fluctuations diverge in the limit h — 0. As a result, 
for d < 2, we must have m, = 0. 


B) Longitudinal Correlation Function 


In the Gaussian approximation, we found for T < T, in Eq. (7.4.54a) that 


1 
sath : 
Bab no CU) = Tae 


as for n = 1 . In fact, one would expect that the strong transverse fluctuations 
would modify the behavior of G\(k). Going beyond the Gaussian approxi- 
mation, we now calculate the contribution of orientation fluctuations to the 
longitudinal fluctuations. We consider a rotation of the magnetization at the 
point x and decompose the change dm into a component 6m, parallel and 
a vector 6m, perpendicular to mo (Fig. 7.17). The invariance of the length 
yields the condition 


m2 = m2 + 2modm1 + 6m? + (6m)? ; 


and it follows from this owing to |6m,| < mo that 


1 
dm, = -——(dm,)”. (7.4.64) 
2mo 
dm 
Mo Mop + 6m 


Fig. 7.17. The rotation of the spontaneous magnetiza- 
tion in isotropic systems 


For the correlation of the longitudinal fluctuations, one obtains from this 
the following relation to the transverse fluctuations: 


(6m1(x)dm1(0)) = rag (Ot. *()5m*(0) ; (7.4.65) 


We now factor this correlation function into the product of two transverse cor- 
relation functions, Eq. (7.4.54a), and obtain from it the Fourier-transformed 
longitudinal correlation function 
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—2r/fBr h d—4 4 
re ( ) ~ he? (7.4.66) 


_ de 
Gie=H= if io 


In three dimensions, we find from this for the longitudinal susceptibility 


my 


ppom = Gi\(k = 0) ~ h-2. (7.4.66’) 


In the vicinity of the critical point T., we found m ~ hs (see just after 
Eq. (7.2.4b)); in contrast to (7.4.66), this yields 


Om, 6-1 
~NY h~ ~s 
oh ' 


In isotropic systems, the longitudinal susceptibility is not just singular only 
in the critical region, but instead in the whole coexistence region for h — 0 
(cf. Fig. 7.18). This is a result of rotational invariance. 


Fig. 7.18. Singularities in 
the longitudinal susceptibility 
in systems with internal rota- 
tional symmetry, n > 2 


C) Coexistence Singularities 


The term coexistence region denotes the region of the phase diagram with 
a finite magnetization in the limiting case of h — 0. The coexistence sin- 
gularities found in (7.4.54a), (7.4.62), and (7.4.66) for isotropic systems are 
exactly valid. This can be shown as follows: for T < T°, the Ginzburg-Landau 
functional can be written in the form 


cl [ate(So (m?- aly" (Vm)? — hm — o) 


In this expression, we have inserted (7.4.38) and have combined the com- 
ponents m/(x),7 > 2 into a vector of the transverse fluctuations m’ (x) = 
(0, m5(x),...,mi,(x)). Using (7.4.18) and m4 (x) < mj, one obtains 


“7.4 The Ginzburg-Landau Theory 377 


(7.4.68) 


1\2 1 \2 ’ lal? 
+ c(Vmi‘) +c(Vm‘, ) h(my mi) or i 


The terms which are nonlinear in the transverse fluctuations are absorbed 
into the longitudinal terms by making the substitution 
m’” 
n=, a (7.4.69) 


7 2m, 


F(m] = [ate (20mm? + (Vm) 
(7.4.70) 


h 2 2 h? al? 
+ —m') +c(Vm' hm, 4 ) 

ae ee ae 8bm? 2b 
The final result for the free energy is harmonic in the variables m// and 
m’,. As a result, the transverse propagator in the coexistence region is given 
exactly by (7.4.54a). The longitudinal correlation function is 


(m}(x)m4(0))¢ = (mi (x)mi{(0)) + a7 1 (0)*m's (0)") - (7.4.71) 


In equation (7.4.70), terms of the form (V mi)? and Viiv have been 
neglected. 
The second term in (7.4.69) leads to a reduction of the order parameter 


12 
— ea), Eq. (7.4.71) gives the cumulant, i.e. the correlation function of the 
deviations from the mean value. Since (7.4.70) now contains only harmonic 
terms, the factorization of the second term in the sum in (7.4.71) is exact, as 
used in Eq. (7.4.65). One could still raise the objection to the derivation of 
(7.4.71) that a number of terms were neglected. However, using renormaliza- 
tion group theory!’, it can be shown that the anomalies of the coexistence 
region are described by a low-temperature fixed point at which mg = oc. 


This means that the result is asymptotically exact. 


7.4.4.2 First-Order Phase Transitions 


There are systems in which not only the transition from one orientation of 
the order parameter to the opposite direction is of first order, but also the 
transition at T,. This means that the order parameter jumps at 7, from 
zero to a finite value (an example is the ferroelectric transition in BaTiOs3). 
This situation can be described in the Ginzburg-Landau theory, if b < 0, 


19 1D. Lawrie, J. Phys. A14, 2489 (1981); ibid., A18, 1141 (1985); U.C. Tauber 
and F. Schwabl, Phys. Rev. B46, 3337 (1992). 
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and if a term of the form sume with v > 0 is added for stability. Then the 


Ginzburg-Landau functional takes the form 


Fez jf atefam? + (Vm) + bm + some} ‘ (7.4.72) 


where a = a’(T — T°). The free energy density is shown in Fig. 7.19 for a 
uniform order parameter. 


Fig. 7.19. The free energy density 
in the vicinity of a first-order phase 
transition at temperatures T < T°, 
TeTl?,T=al,T7T<T1,T>Nh 


For T > T;, there is only the minimum at m = 0, that is the non-ordered 
state. At T,, a second relative minimum appears, which for T < JT, finally 
becomes deeper than that at m = 0. For T < T°, the m = 0 state is unstable. 
The stationarity condition is 


(a + bm? + 35m*)m =0 , (7.4.73) 
and the condition that a minimum is present is 

10°F v 

re 3bm? 4 155m" 0. (7.4.74) 


The solutions of the stationarity condition are 
mo = 0 (7.4.75a) 


and 


b+ (8  2a\? 
2 
m= — 350) ( 52 *) (7.4.75b) 


We recall that b < 0. The nonzero solution with the minus sign corresponds 
to a maximum in the free energy and will be left out of further consideration. 
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The minimum (7.4.75b) exists for all temperatures for which the discriminant 
is positive, i.e. below the temperature 7} 


0, 2 
T, = T : 
: a 6va! 
T, is the superheating temperature (see Fig. 7.19 and below). The transition 
temperature T, is found from the condition that the free energy for (7.4.75b) 
is zero. At this temperature (see Fig. 7.19), the free energy has a double zero 
at m? = mé and thus has the form 


(7.4.76) 


b 
5 (m? = mi) m? = (a + =m? + =m) m? 


2 2 
2 2 
Uf» b ) b 2 
=> —} —— =0. 
(5 (m ron sta) 
It follows from this that a= & and m? = —£, which both lead to 
b2 
T= Tl + (7.4.77) 
8va’ 


For T < T°, there is a local maximum at m = 0. T° plays the role of a 
supercooling temperature. In the range T? < T < T,, both phases can thus 
coexist, i.e. the supercooling or superheating of a phase is possible. Since for 
T° < T < T,, the non-ordered phase (mp = 0) is metastable; for T; > T > Ty, 
in contrast, the ordered phase (mo 4 0) is metastable. On slow cooling, so 
that the system attains the state of lowest free energy, mo jumps at T, from 
0 to 


b pope \ 1? b 
2 ape = = —— 7.4.78 
mo(Te) 3u a (sa at) Qu’ ( ) 
and, below T,, it has the temperature dependence (Fig. 7.20) 
2 3 (P= 2) 
2 = 2 A 1 1 Cc 
mo 


Fig. 7.20. The temperature depen- 
T. T dence of the magnetization in a first- 
e order phase transition 
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*7.4.5 The Momentum-Shell Renormalization Group 


The RG theory can also be carried out in the framework of the G—L func- 
tional, with the following advantages compared to discrete spin models: the 
method is also practicable in higher dimensions, and various interactions and 
symmetries can be treated. One employs an expansion of the critical expo- 
nents in e = 4—d. Here, we cannot go into the details of the necessary 
perturbation-theoretical techniques, but rather just show the essential struc- 
ture of the renormalization group recursion relations and their consequences. 
For the detailed calculation, the reader is referred to more extensive descrip- 


tions?°»?! and to the literature at the end of this chapter. 


7.4.5.1 Wilson’s RG Scheme 


We now turn to the renormalization group transformation for the Ginzburg— 
Landau functional (7.4.10). In order to introduce the notation which is usual 
in this context, we carry out the substitutions 


1 
m=—¢, a=rc, b=uc? and h— V2ch, 7.4.79 
Je? Vv ( ) 


and obtain the so called Landau-—Ginzburg—Wilson functional: 


F(d| = jf ee[se + (62)? 4 (Vo)? hd| . (7.4.80) 
An intuitively appealing method of proceeding was proposed by K.G. Wil- 
son??.?!_ Essentially, the trace over the degrees of freedom with large k in 
momentum space is evaluated, and one thereby obtains recursion relations 
for the Ginzburg-Landau coefficients. Since it is to be expected that the 
detailed form of the short-wavelength fluctuations is not of great importance, 
the Brillouin zone can be approximated as simply a d-dimensional sphere of 
radius (cutoff) A, Fig. 7.21. 


Fig. 7.21. The momentum-space RG: the par- 
tial trace is performed over the Fourier com- 
ponents @, with momenta within the shell 


<_— A > A/b <|k| <A 


20 Wilson, K.G. and Kogut, J., Phys. Rep. 12 C, 76 (1974). 
21'S. Ma, Modern Theory of Critical Phenomena, Benjamin, Reading, 1976. 
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The momentum-shell RG transformation then consists of the following 
steps: 


(i) Evaluating the trace over all the Fourier components ¢, with 
A/b < |k| < A (Fig. 7.21) eliminates these short-wavelength modes. 


(ii) By means of a scale transformation?” 


k! = bk, (7.4.81) 

g =b¢, (7.4.82) 
and therefore 

dy = 8“ 6, , (7.4.83) 


the resulting effective Hamiltonian functional can be brought into a form re- 
sembling the original model, whereby effective scale-dependent coupling pa- 
rameters are defined. Repeated application of this RG transformation (which 
represents a semigroup, since it has no inverse element) discloses the pre- 
sumably universal properties of the long-wavelength regime. As in the real- 
space renormalization group transformation of Sect. 7.3.3, the fixed points 
of the transformation correspond to the various thermodynamic phases and 
the phase transitions between them. The eigenvalues of the linearized flow 
equations in the vicinity of the critical fixed point finally yield the critical 
exponents (see (7.3.41a,b,c)). Although a perturbational expansion (in terms 
of u) is in no way justifiable in the critical region, it is completely legitimate 
at some distance from the critical point, where the fluctuations are negli- 
gible. The important observation is now that the RG flow connects these 
quite different regions, so that the results of the perturbation expansion in 
the non-critical region can be transported to the vicinity of T., whereby the 
non-analytic singularities are consistently, controllably, and reliably taken 
into account by this mapping. Perturbation-theoretical methods can likewise 
be applied in the elimination of the short-wavelength degrees of freedom 


(step (i). 


7.4.5.2 Gaussian Model 


We will now apply the concept described in the preceding section first to the 
Gaussian model, where u = 0 (see Sect. 7.4.3), 


2 Tf one considers (7.4.83) together with the field term in the Ginzburg-Landau 
functional (7.4.10), then it can be seen that the exponent ¢ determines the 
transformation of the external field and is related to yn from Sect. (7.3.4) via 
¢ =d— Yh- 
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2 
Ce el (7.4.84) 
|k|<A 
with f,, = { d¢k/(27)%. Since (7.4.84) is diagonal in the Fourier modes, the 
elimination of the components with large k merely produces a constant con- 
tribution (independent of @); the form of the effective Hamiltonian functional 
remains unchanged, provided that 
d—2 


(=F ien=0, (7.4.85) 


and r transforms as 


rf =r. (7.4.86) 


The fixed points of Eq. (7.4.86) are r* = +oo, corresponding to the high- and 
low-temperature phases, and the critical fixed point r* = 0. The eigenvalue 
for the relevant temperature-direction at this critical fixed point is clearly 
Yr = 2, and therefore one obtains the same exponent v = 1/2 from (7.3.41c,d) 
as in the molecular field theory or the Gaussian approximation (Sect. 7.4.3). 


7.4.5.3 Perturbation Theory and the e-Expansion 


The nonlinear interaction term in Eq. (7.4.80) 
u 
Fint|Px] = 3 ff Pike, Ph Pkg P—ky —ko—ks (7.4.87) 
|ki|<A 


can now be treated using perturbation theory, by expanding the exponential 
function in Eq. (7.4.6) in terms of wu. If one separates the field variables into 
their parts in the inner and the outer momentum shell, 


Pp = Pre t+ Pry ; 


with |ke| < A/b and A/b < |ks| < A, the result to first order in u includes 
terms of the following (symbolically written) form (from now on, we set kT’ 
equal to 1): 


(i) uf dLe-7%° must merely be re-exponentiated, since these degrees of free- 
dom are not eliminated; 

(ii) all terms with an uneven number of ¢< or @s, such as for example 
uf ¢.¢se-7°, vanish; 

(iii) uf ¢4e~7° makes a constant contribution to the free energy and finally 
to uf ¢2¢%e-7°, for which the Gaussian integral over the $5 can be 
carried out with the aid of Eq. (7.4.47) for the propagator 
(Oe, ob? Ke, )0 = ates: an average value which is calculated with the 
statistical weight e~7°. 
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Quite generally, Wick’s theorem?°:?! states that expressions of the form 


(I a) = (bh >Pke> : + Phen >) 
a 0 


factorize into a sum of products of all possible pairs (¢x, 6-4, )o if m is even, 
and otherwise they yield zero. Especially in the treatment of higher orders of 
perturbation theory, the Feynman diagrams offer a very helpful representa- 
tion of the large number of contributions which have to be summed in the per- 
turbation expansion. In these diagrams, lines symbolize the propagators and 
interaction vertices stand for the nonlinear coupling u. With these means at 
our disposal, we can compute the two-point function (@,_ @_,_) and the sim- 
ilarly defined four-point function.Using Eq. (7.4.47), one then obtains in the 
first non-trivial order (“1-loop”, a notation which derives from the graphical 
representation) the following recursion relation between the initial coefficients 
r,u and the transformed coefficients r’, u’ of the Ginzburg—Landau—Wilson 
functional???!; 


rf =0(r+(n+2) A(r)u) , (7.4.88) 


u = b*4u(1 — (n +8) C(r)u) , (7.4.89) 


where A(r) and C(r) refer to the integrals 


A 


A(r) = Ka [wor + k?)dk 
=K,4[A*?(1 — 8?-*) /(d — 2) — rA*4(1 — 64%) /(d — 4)] + O(r?) 
A 
C(r) = Ka [wow + k*)?| dk 


= KgAt4(1 — b*-4)/(d- 4) + O(r) , 


with Ky = 1/2¢-!n4/?T'(d/2), and the factors depending on the number n 
of components of the order parameter field result from the combinatorial 
analysis in counting the equivalent possibilities for “contracting” the fields 
gx, i.e. for evaluating the integrals over the large momenta. We note that 
here again, Eq. (7.4.85) applies. 

Linearizing equations (7.4.88) and (7.4.89) at the Gaussian fixed point 
r* = 0, u* = 0, one immediately finds the eigenvalues y, = 2 and y, = 4—d. 
Then for d > d. = 4, the nonlinearity «x u is seen to be irrelevant, and the 
mean field exponents are valid, as already surmised in Sect. 7.4.4. For d < 4 
(d. = 4 is the upper critical dimension), the fluctuations however become 
relevant and each initial value u # 0 increases under the renormalization 
group transformation. In order to obtain the scaling behavior in this case, 
we must therefore search for a finite, non-trivial fixed point. This can be 
most easily done by introducing a differential flow, with b’ = e% and 6 — 0, 
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making the number of RG steps effectively into a continuous variable, and 
studying the resulting differential recursion relations: 


7 = 2r(Q) + (n+ 2)u()Kad® — (n+ 2)r(u()Kad™™ , (7.4.90) 
mt) = (4—d)u(£) — (n+ 8)u(t)?KaAe . (7.4.91) 


Now, a fixed point is defined by the condition dr/dé = 0 = du/dé. 


Fig. 7.22. Flow of the effective coupling 

u u(€), determined by the right-hand side of 
Eq. (7.4.91), which is plotted here as the 
ordinate. Both for initial values uo > uz 
and 0 < uo < us, one finds u(@) > uz for 
Lc 


Ue 


Figure 7.22 shows the flow of u(é) corresponding to Eq. (7.4.91); for any 
initial value uo 4 0, one finds that asymptotically, i.e. for 2 > oo, the non- 
trivial fixed point 


€ 
n+8 


AS 


u.Ka = , €=4-d (7.4.92) 
is approached; this should determine the universal critical properties of the 
model. As in the real-space renormalization in Sect. 7.3, the RG transforma- 
tion via momentum-shell elimination generates new interactions; for example, 
terms « o° and V2¢', etc., which again influence the recursion relations for 
r and u in the succeeding steps. It turns out, however, that up to order e°, 
these terms do not have to be taken into account.???4 
The original assumption that u should be small, which justified the per- 
turbation expansion, now means in light of Eq. (7.4.92) that the effective 
expansion parameter here is the deviation from the upper critical dimension, 
e. If one inserts (7.4.92) into Eq. (7.4.90) and includes terms up to O(e), the 
result is 
n+2 


ete! * d—-2 __ 
t= 5 u, Kg A * = — 


(n+2)e 15 
2(n + 8) 


(7.4.93) 


The physical interpretation of this result is that fluctuations lead to a lowering 
of the transition temperature. With tT = r — rz, the differential form of the 
flow equation 


dr(é) 
de 


= 7(0) (2 —(n+2)uKa a) (7.4.94) 
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finally yields the eigenvalue y, = 2 — (n+ 2)e/(n + 8) in the vicinity of the 
critical point (7.4.92). In the one-loop order which we have described here, 
O(e), one therefore finds for the critical exponent v from Eq. (7.3.41le) 


i n+2 2 
v= 5 + An +8) e+ O(e7). (7.4.95) 


Using the result 1 = O(c?) and the scaling relations (7.3.41a-d), one ob- 
tains the following expressions (the difference from the result (7.4.35) of the 
Gaussian approximation is remarkable) 


a= n+ 8) e+ Ole?) , (7.4.96) 
1 3 , 

b= 5 t Oe), (7.4.97) 
n+2 9 

y=1+ Marat ys (7.4.98) 

6=3+6€4+ O(c’) (7.4.99) 


to first order in the expansion parameter « = 4 — d. The first non-trivial 
contribution to the exponent 7 appears in the two-loop order, 


n+2 


dn + 82 e* + O(c). (7.4.100) 


n= 


The universality of these results manifests itself in the fact that they depend 
only on the spatial dimension d and the number of components n of the 
order parameter, and not on the original “microscopic” Ginzburg-Landau 
parameters. 


Remarks: 


(i) At the upper critical dimension, d. = 4, an inverse power law is obtained 
as the solution of Eq. (7.4.91) instead of an exponential behavior, leading 
to logarithmic corrections to the mean-field exponents. 

(ii) We also mention that for long-range interactions which exhibit power- 
law behavior « |a|~¢+?), the critical exponents contain an additional 
dependence on the parameter co. 

(iii) In addition to the e-expansion, an expansion in terms of powers of 1/n is 
also possible. Here, the limit n — oo corresponds to the exactly solvable 
spherical model.?? This 1/n-expansion indeed helps to clarify some gen- 
eral aspects but its numerical accuracy is not very great, since precisely 
the small values of n are of practical interest. 


3 Shane-Keng Ma, Modern Theory of Critical Phenomena, Benjamin, Reading, 
1976. 
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The differential recursion relations of the form (7.4.90) and (7.4.91) also serve 
as a basis for the treatment of more subtle issues such as the calculation of 
the scaling functions or the treatment of crossover phenomena within the 
framework of the RG theory. Thus, for example, an anisotropic perturbation 
in the n-component Heisenberg model favoring m directions leads to a cross- 
over from the O(n)-Heisenberg fixed point?* to the O(m) fixed point.?° The 
instability of the former is described by the crossover exponent. For small 
anisotropic disturbances, to be sure, the flow of the RG trajectory passes very 
close to the unstable fixed point. This means that one finds the behavior of 
an n-component system far from the transition temperature T., before the 
system is finally dominated by the anisotropic critical behavior. 

The crossover from one RG fixed point to another can be represented 
(and measured) by the introduction of effective exponents. These are defined 
as logarithmic derivatives of suitable physical quantities. Other important 
perturbations which were treated within the RG theory are, on the one hand, 
cubic terms. They reflect the underlying crystal structure and contribute 
terms of fourth order in the cartesian components of @ to the Ginzburg— 
Landau-—Wilson functional. On the other hand, dipolar interactions lead to a 
perturbation which alters the harmonic part of the theory. 


7.4.5.4 More Advanced Field-Theoretical Methods 


If one wishes to discuss perturbation theory in orders higher than the first 
or second, Wilson’s momentum-shell renormalization scheme is not the best 
choice for practical calculations, in spite of its intuitively appealing properties. 
The technical reason for this is that the integrals in Fourier space involve 
nested momenta, which owing to the finite cutoff wavelength A are difficult 
to evaluate. It is then preferable to use a field-theoretical renormalization 
scheme with A — co. However, this leads to additional ultraviolet (UV) 
divergences of the integrals for d > d-. At the critical dimension d,., both 
ultraviolet and infrared (IR) singularities occur in combination in logarithmic 
form, [x log(A?/r)]. The idea is now to treat the UV divergences with the 
methods originally developed in quantum field theory and thus to arrive at the 
correct scaling behavior for the IR limit. In the formal implementation, one 
takes advantage of the fact that the original unrenormalized theory does not 
depend on the arbitrarily chosen renormalization point; as a consequence, one 
obtains the Callan—Symanzik- or RG equations. These are partial differential 
equations which correspond to the differential flow equations in the Wilson 
scheme. 


ae O(n) indicates invariance with respect to rotations in n-dimensional space, 
i.e. with respect to the group O(n). 

25 See D.J. Amit, Field Theory, the Renormalization Group and Critical Phenom- 
ena, 2nd ed., World Scientific, Singapore, 1984, Chap. 5-3. 
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e-expansions have been carried out up to the seventh order;”° the series 
obtained is however only asymptotically convergent (the convergence radius 
of the perturbation expansion in u clearly must be zero, since u < 0 cor- 
responds to an unstable theory). The combination of the results from ex- 
pansions to such a high order with the divergent asymptotic behavior and 
Borel resummation techniques yields critical exponents with an impressive 
precision; cf. Table 7.4. 


Table 7.4. The best estimates for the static critical exponents v, 3, and 6, for the 
O(n)-symmetric ¢* model in d = 2 and d = 3 dimensions, from ¢-expansions up 
to high order in connection with Borel summation techniques.2° For comparison, 
the exact Onsager results for the 2d-Ising model are also shown. The limiting case 
n = 0 describes the statistical mechanics of polymers. 


Y y B n 


d=2 n=0 1.39+ 0.04 0.76 + 0.03 0.065+ 0.015 0.21 +0.05 
n=1 1.73+0.06 0.99 + 0.04 0.12040.015 0.26 + 0.05 
2d Ising (exact) 1.75 1 0.125 0.25 


d=3 n=0 1.160 + 0.004 0.5885 + 0.0025 0.3025 + 0.0025 0.031 + 0.003 
n=1 1.239 + 0.004 0.6305 + 0.0025 0.3265 + 0.0025 0.037 + 0.003 
n=2 1.3815+0.007 0.67140.005 0.3485 + 0.0035 0.040 + 0.003 
n=3 1.3890+0.010 0.710+0.007 0.368+0.004 0.040 + 0.003 


*7.5 Percolation 


Scaling theories and renormalization group theories also play an important 
role in other branches of physics, whenever the characteristic length tends to 
infinity and structures occur on every length scale. Examples are percolation 
in the vicinity of the percolation threshold, polymers in the limit of a large 
number of monomers, the self-avoiding random walk, growth processes, and 
driven dissipative systems in the limit of slow growth rates (self-organized 
criticality). As an example of such a system which can be described in the 
language of critical phenomena, we will consider percolation. 


7.5.1 The Phenomenon of Percolation 


The phenomenon of percolation refers to problems of the following type: 


(i) Consider a landscape with hills and valleys, which gradually fills up with 
water. When the water level is low, lakes are formed; as the level rises, some of 


6 J.C. Le Guillou and J.C. Zinn-Justin, J. Phys. Lett. 46 L, 137 (1985) 
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the lakes join together until finally at a certain critical level (or critical area) 
of the water, a sea is formed which stretches from one end of the landscape 
to the other, with islands. 

(ii) Consider a surface made of an electrical conductor in which circular holes 
are punched in a completely random arrangement (Fig. 7.23a). Denoting the 
fraction of remaining conductor area by p, we find for p > p, that there 
is still an electrical connection from one end of the surface to the other, 
while for p < pe, the pieces of conducting area are reduced to islands and no 
longer form continuous bridges, so that the conductivity of this disordered 
medium is zero. One refers to p, as the percolation threshold. Above pe, there 
is an infinite “cluster”; below this limit, there are only finite clusters, whose 
average radius however diverges on approaching p.. Examples (i) and (ii) 
represent continuum percolation. Theoretically, one can model such systems 
on a discrete d-dimensional lattice. In fact, such discrete models also occur 
in Nature, e.g. in alloys. 


a) c) 


Fig. 7.23. Examples of percolation (a) A perforated conductor (Swiss cheese 
model): continuum percolation; (b) site percolation; (c) bond percolation 


(iii) Let us imagine a square lattice in which each site is occupied with a 
probability p and is unoccupied with the probability (1 — p). ‘Occupied’ can 
mean in this case that an electrical conductor is placed there and ‘unoccu- 
pied’ implies an insulator, or that a magnetic ion or a nonmagnetic ion is 
present, cf. Fig. 7.23b. Staying with the first interpretation, we find the fol- 
lowing situation: for small p, the conductors form only small islands (electric 
current can flow only between neighboring sites) and the overall system is an 
insulator. As p increases, the islands (clusters) of conducting sites get larger. 
Two lattice sites belong to the same cluster when there is a connection be- 
tween them via occupied nearest neighbors. For large p (p < 1) there are 
many conducting paths between the opposite edges and the system is a good 
conductor. At an intermediate concentration p,, the percolation threshold or 
critical concentration, a connection is just formed, i.e. current can percolate 
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from one edge of the lattice to the other. The critical concentration separates 
the insulating phase below p, from the conducting phase above pe. 


In the case of the magnetic example, at p, a ferromagnet is formed from 
a paramagnet, presuming that the temperature is sufficiently low. A further 
example is the occupation of the lattice sites by superconductors or normal 
conductors, in which case a transition from the normal conducting to the 
superconducting state takes place. 

We have considered here some examples of site percolation, in which the 
lattice sites are stochastically occupied, Fig. 7.23b. Another possibility is that 
bonds between the lattice sites are stochastically present or are broken. One 
then refers to bond percolation (cf. Fig. 7.23c). Here, clusters made up of 
existing bonds occur; two bonds belong to the same cluster if there is a con- 
nection between them via existing bonds. Two examples of bond percolation 
are: (i) a macroscopic system with percolation properties can be produced 
from a stochastic network of resistors and connecting wires; (ii) a lattice of 
branched monomers can form bonds between individual monomers with a 
probability p. For p < pe, finite macromolecules are formed, and for p > pe, 
a network of chemical bonds extends over the entire lattice. This gelation 
process from a solution to a gel state is called the sol-gel transition (example: 
cooking or “denaturing” of an egg or a pudding); see Fig. 7.23. 


Remarks: 


(i) Questions related to percolation are also of importance outside physics, e.g. in 
biology. An example is the spread of an epidemic or a forest fire. An affected 
individual can infect a still-healthy neighbor within a given time step, with a 
probability p. The individual dies after one time step, but the infected neigh- 
bors could transmit the disease to other still living, healthy neighbors. Below 
the critical probability p., the epidemic dies out after a certain number of time 
steps; above this probability, it spreads further and further. In the case of a 
forest fire, one can think of a lattice which is occupied by trees with a prob- 
ability p. When a tree burns, it ignites the neighboring trees within one time 
step and is itself reduced to ashes. For small values of p, the fire dies out after 
several time steps. For p > pc, the fire spreads over the entire forest region, as- 
suming that all the trees along one boundary were ignited. The remains consist 
of burned-out trees, empty lattice sites, and trees which were separated from 
their surroundings by a ring of empty sites so that they were never ignited. For 
p > Pc, the burned-out trees form an infinite cluster. 

(ii) In Nature, disordered systems often occur. Percolation is a simple example 
of this, in which the occupation of the individual lattice sites is uncorrelated 
among the sites. 


As emphasized above, these models for percolation can also be introduced 
on a d-dimensional lattice. The higher the spatial dimension, the more possi- 
ble connected paths there are between sites; therefore, the percolation thresh- 
old p, decreases with increasing spatial dimension. The percolation threshold 
is also smaller for bond percolation than for site percolation, since a bond 
has more neighboring bonds than a lattice site has neighboring lattice sites 
(in a square lattice, 6 instead of 4). See Table 7.5. 
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Table 7.5. Percolation thresholds and critical exponents for some lattices 


Lattice De B v y 
site bond 
one-dimensional 1 1 = 1 1 
square 0.592 1/2 a 4 8 
simple cubic 0.311 0.248 0.417 0.875 1.795 
Bethe lattice st ay 1 1 1 
d=6hypercubic 0.107 0.0942 1 $ 1 
d=7hypercubic 0.089 0.0787 1 $ 1 


The percolation transition, in contrast to thermal phase transitions, has 
a geometric nature. When p increases towards pe, the clusters become larger 
and larger; at p., an infinite cluster is formed. Although this cluster already 
extends over the entire area, the fraction of sites which it contains is still 
zero at pe. For p > pe, more and more sites join the infinite cluster at the 
expense of the finite clusters, whose average radii decrease. For p = 1, all 
sites naturally belong to the infinite cluster. The behavior in the vicinity 
of pe exhibits many similarities to critical behavior in second-order phase 
transitions in the neighborhood of the critical temperature T.. As discussed 
in Sect. 7.1, the magnetization increases below T, as M ~ (T, — T)”. In the 
case of percolation, the quantity corresponding to the order parameter is the 
probability Ps, that an occupied site (or an existing bond) belongs to the 
infinite cluster, Fig. (7.24). Accordingly, 


0 f c 
Pye X Ree (7.5.1) 
(p x De) for p> pe. 


Fig. 7.24. Ps: order parameter (the 
strength of the infinite clusters); S: aver- 
1—pe 1—p age number of sites in a finite cluster 


The correlation length € characterizes the linear dimension of the finite clus- 
ters (above and below p,). More precisely, it is defined as the average distance 
between two occupied lattice sites in the same finite cluster. In the vicinity 
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of pe, € behaves as 
E~|p—pel ”. (7.5.2) 


A further variable is the average number of sites (bonds) in a finite cluster. 
It diverges as 


S~|p—pel 7 (7.5.3) 


and corresponds to the magnetic susceptibility .; cf. Fig. (7.24). 

Just as in a thermal phase transition, one expects that the critical prop- 
erties (e.g. the values of 3,v,7) are universal, i.e. that they do not depend on 
the lattice structure or the kind of percolation (site, bond, continuum percola- 
tion). These critical properties do, however, depend on the spatial dimension 
of the system. The values of the exponents are collected in Table 7.5 for several 
different lattices. One can map the percolation problem onto an s-state-Potts 
model, whereby the limit s — 1 is to be taken.?”°?8 From this relation, it is 
understandable that the upper critical dimension for percolation is d. = 6. 
The Potts model in its field-theoretical Ginzburg-Landau formulation con- 
tains a term of the form ¢°; from it, following considerations analogous to the 
¢* theory, the characteristic dimension d, = 6 is derived. The critical expo- 
nents 3,v,y describe the geometric properties of the percolation transition. 
Furthermore, there are also dynamic exponents, which describe the transport 
properties such as the electrical conductivity of the perforated circuit board 
or of the disordered resistance network. Also the magnetic thermodynamic 
transitions in the vicinity of the percolation threshold can be investigated. 


7.5.2 Theoretical Description of Percolation 


We consider clusters of size s, i.e. clusters containing s sites. We denote the 
number of such s-clusters divided by the number of all lattice sites by nz, 
and call this the (normalized) cluster number. Then sn, is the probability 
that an arbitrarily chosen site will belong to a cluster of size s. Below the 
percolation threshold (p < p.), we have 


(7.5.4) 


oS number of all the occupied sites 
y SNs = z =. = 
a total number of lattice sites 


The number of clusters per lattice site, irrespective of their size, is 
Se (7.5.5) 
s 


2” C.M. Fortuin and P. W. Kasteleyn, Physica 57, 536 (1972). 

28 The s-state-Potts model is defined as a generalization of the Ising model, which 
corresponds to the 2-state-Potts model: at each lattice site there are s states Z. 
The energy contribution of a pair is —Jéz,z’, i.e. —J if both lattice sites are in 
the same state, and otherwise zero. 
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The average size (and also the average mass) of all finite clusters is 


S= Sus = — 87 ng. (7.5.6) 


s=1 


The following relation holds between the quantity P,. defined before (7.5.1) 
and n,: we consider an arbitrary lattice site. It is either empty or occupied 
and belongs to a cluster of finite size, or it is occupied and belongs to the 
infinite cluster, that is 1 = 1—p+ 30%, 8n,+p Px, and therefore 


1 
Pyo =1-—- 51s. laDuke 


7.5.3 Percolation in One Dimension 


We consider a one-dimensional chain in which every lattice site is occupied 
with the probability p. Since a single unoccupied site will interrupt the con- 
nection to the other end, i.e. an infinite cluster can be present only when all 
sites are occupied, we have p, = 1. In this model we can thus study only the 
phase p < pe. 

We can immediately compute the normalized number of clusters n, for 
this model. The probability that an arbitrarily chosen site belongs to a clus- 
ter of size s has the value sp*(1 —p)’, since a series of s sites must be 
occupied (factor p*) and the sites at the left and right boundaries must be 
unoccupied (factor (1 —p)*). Since the chosen site could be at any of the 
s locations within the clusters, the factor s occurs. From this and from the 
general considerations at the beginning of Sect. 7.5.2, it follows that: 


ns =p’ (1—p). (7.5.8) 


With this expression and starting from (7.5.6), we can calculate the average 
cluster size: 


1 ie tp) Pye 
Sa sns= 5 DSP ae P) (o=) Sov 


Pp 

; : ae (7.5.9) 
var ( =) | ee Saal ae 

Lee. - Leap 


P<DPc- 


The average cluster size diverges on approaching the percolation threshold 
pe = 1as1/(1—p), ie. in one dimension, the exponent introduced in (7.5.3) 
isy=1. 

We now define the radial correlation function g(r). Let the zero point be 
an occupied site; then g(r) gives the average number of occupied sites at a 
distance r which belong to the same cluster as the zero point. This is also 
equal to the probability that a particular site at the distance r is occupied and 
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belongs to the same cluster, multiplied by the number of sites at a distance 
r. Clearly, g(0) = 1. For a point to belong to the cluster requires that this 
point itself and all points lying between 0 and r be occupied, that is, the 
probability that the point r is occupied and belongs to the same cluster as 0 
is p’, and therefore we find 


g(r) =2p" for r>1. (7.5.10) 


The factor of 2 is required because in a one-dimensional lattice there are two 
points at a distance r. 
The correlation length is defined by 

Wear). oOo ae 


SS Se ae) Sar vey) 


Analogously to the calculation in Eq. (7.5.9), one obtains 


l1+p _ 1l1+p 
2 2 
(l—p)  (p—Pe) 
i.e. here, the critical exponent of the correlation length is y = 1. We can also 
write g(r) in the form 


f= ; (7.511) 


g(r) =e"? = 20°F, (7.5.10’) 


where after the last equals sign, we have taken p ® pe, so that log p = 
log(1 — (1 — p)) = —(1 — p). The correlation length characterizes the (expo- 
nential) decay of the correlation function. 

The average cluster size previously introduced can also be represented in 
terms of the radial correlation function 


S=1+ = g(r) . (7.5.12) 


We recall the analogous relation between the static susceptibility and the 
correlation function, which was derived in the chapter on ferromagnetism, 
Eq. (6.5.42). One can readily convince oneself that (7.5.12) together with 
(7.5.10) again leads to (7.5.9). 


7.5.4 The Bethe Lattice (Cayley Tree) 


A further exactly solvable model, which has the advantage over the one- 
dimensional model that it is defined also in the phase region p > peg, is 
percolation on a Bethe lattice. The Bethe lattice is constructed as follows: 
from the lattice site at the origin, z (coordination number) branches spread 
out, at whose ends again lattice sites are located, from each of which again 
z — 1 new branches emerge, etc. (see Fig. 7.25 for z = 3). 


394 7. Phase Transitions, Renormalization Group Theory, and Percolation 


Fig. 7.25. A Bethe lattice with the coordination 
number z = 3 


The first shell of lattice sites contains z sites, the second shell contains 
z(z — 1) sites, and the Ith shell contains 2(z—1)'~' sites. The number of 
lattice sites increases exponentially with the distance from the center point 
~ e!los(2—1) while in a d-dimensional Euclidean lattice, this number increases 
as 17-1, This suggests that the critical exponents of the Bethe lattice would be 
the same as those of a usual Euclidean lattice for d — oo. Another particular 
difference between the Bethe lattice and Euclidean lattices is the property 
that it contains only branches but no closed loops. This is the reason for its 
exact solvability. 

To start with, we calculate the radial correlation function g(l), which 
as before is defined as the average number of occupied lattice sites within 
the same cluster at a distance / from an arbitrary occupied lattice site. The 
probability that a particular lattice site at the distance / is occupied as well 
as all those between it and the origin has the value p!. The number of all the 
sites in the shell 1 is z(z — 1)'~'; from this it follows that: 

z 


g(t) = 2(z -1)1 p! = (p(z — 1)! = —~<elloe@-)) | (7.5.13) 


z—1 z—1 


From the behavior of the correlation function for large 1, one can read off 
the percolation threshold for the Bethe lattice. For p(z — 1) < 1, there is 
an exponential decrease, and for p(z — 1) > 1, g(l) diverges for | > oo and 
there is an infinite cluster, which must not be included in calculating the 
correlation function of the finite clusters. It follows from (7.5.13) for p, that 


1 
= 5.14 
ae (7.5.14) 


For z = 2, the Bethe lattice becomes a one-dimensional chain, and thus 
De = 1. From (7.5.13) it is evident that the correlation length is 


-1 ai! 1 
x ey ~N 
log[p(z—1)] log = pe-p 


é (7.5.15) 
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for p in the vicinity of p,, i.e. v = 1, as in one dimension’. The same result 
is found if one defines € by means of (7.5.11). For the average cluster size one 
finds for p < pe 


(1 
=1+ Yat yee) for p< pe; (7.5.16) 


Len y= 1. 

The strength of the infinite cluster P.., i.e. the probability that an arbi- 
trary occupied lattice site belongs to the infinite cluster, can be calculated in 
the following manner: the product pP, is the probability that the origin or 
some other point is occupied and that a connection between occupied sites 
up to infinity exists. We first compute the probability Q that an arbitrary 
site is not connected to infinity via a particular branch originating from it. 
This is equal to the probability that the site at the end of the branch is not 
occupied, that is (1 — p) plus the probability that this site is occupied but 
that none of the z — 1 branches which lead out from it connects to ov, i.e. 


Q=1-pt+pQ*'. 


This is a determining equation for Q, which we shall solve for simplicity for 
a coordination number z = 3. The two solutions of the quadratic equation 
are @Q=1andQ= ae 

The probability that the origin is occupied, that however no path leads 
to infinity, is on the one hand p(1— P,,) and on the other p Q”, i.e. for z = 3: 


Pi, =1-Q?. 


For the first solution, Q = 1, we obtain P,, = 0, obviously relevant for p < pe; 
and for the second solution 


Py, =1- (<2) (7.5.17) 


Pp 


for p > pe. In the vicinity of p. = 5, the strength of the infinite clusters varies 
as 


that is G = 1. We will also obtain this result with Eq. (7.5.30) in a different 
manner. 


29 Earlier, it was speculated that hypercubic lattices of high spatial dimension have 
the same critical exponents as the Bethe lattice. The visible difference in v seen 
in Table 7.5 is due to the fact that in the Bethe lattice, the topological (chemical) 
and in the hypercubic lattice the Euclidean distance was used. If one uses the 
chemical distance for the hypercubic lattice also, above d = 6, v = 1 is likewise 
obtained. See Literature: A. Bunde and S. Havlin, p. 71. 
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Now we will investigate the normalized cluster number n,, which is also 
equal to the probability that a particular site belongs to a cluster of size s, 
divided by s. In one dimension, n; could readily be determined. In general, 
the probability for a cluster with s sites and t (empty) boundary points is 
pr(l— p)’. The perimeter t includes external and internal boundary points 
of the cluster. For general lattices, such as e.g. the square lattice, there are 
various values of t belonging to one and the same value of s, depending on 
the shape of the cluster; the more stretched out the cluster, the larger is t, 
and the more nearly spherical the cluster, the smaller is t. In a square lattice, 
there are two clusters having the size 3, a linear and a bent cluster. The 
associated values of t are 8 and 7, and the number of orientations on the 
lattice are 2 and 4. For general lattices, the quantity g,, must therefore be 
introduced; it gives the number of clusters of size s and boundary t. Then 
the general expression for ng is 


ns = )- gst p*(1—p)' . (7.5.19) 
t 


For arbitrary lattices, a determination of gs: is in general not possible. For 
the Bethe lattice, there is however a unique connection between the size s of 
the cluster and the number of its boundary points t. A cluster of size 1 has 
t = z, and acluster of s = 2 has t = 2z— 2. In general, a cluster of size s has 
z—2 more boundary points than a cluster of size s — 1, i.e. 


t(s) =z+(s—1)(z-—2) =24 s(z-2). 
Thus, for the Bethe lattice, 
s 2+(z—-2)s 
ns = 9s p*(L—p) (7.5.20) 
where gs is the number of configurations of clusters of the size s. In order to 


avoid the calculation of gs, we will refer ns(p) to the distribution n,(p-) at 
Pe- 

We now wish to investigate the behavior of n,; in the vicinity of pp. = 
(z—1)7' as a function of the cluster size, and separate off the distribution 
at De; 


Ss 


nate) =reloo)[2—P] [2(E=PY] (7.5.21 


we then expand around p = pe 


no) = mon [S=E] [tgs +l 29) 


= Ns(De) e- ’ 


s 


(7.5.22) 


with c= log(1 wt) x (p Pe)” : 


This means that the number of clusters of size s decreases exponentially. 
1 
The second factor in (7.5.22) depends only on the combination (p — pe), 
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with o = 1/2. The exponent o determines how rapidly the number of clusters 
decreases with increasing size s. At p-, the s-dependence of n, arises only from 
the prefactor n,(p-). In analogy to critical points, we assume that n,(p-) is 
a pure power law; in the case that € gives the only length scale, which is 
infinite at p.; then at p,. there can be no characteristic lengths, cluster sizes, 
etc. That is, ns(p-) can have only the form 


Ns(Pc)v 8S". (7.5.23) 


The complete function (7.5.22) is then of the form 


ns(p) =s "f ((v — pe)? 8) . (7.5.24) 


and it is a homogeneous function of s and (p—p-,). We can relate the exponent 
7 to already known exponents: the average cluster size is, from Eq. (7.5.6), 


S= . d, s’n5(p) « S- a ter 


CO Co 
x / ds s*-Te- 8 = ae | 2 Te*dz. 
1 c 


For 7 < 3, the integral exists, even when its lower limit goes to zero: it is 
then 


(7.5.25) 


T-3 


Sw cT73 = (p = De) 7 ; (7.5.26) 
from which, according to (7.5.3), it follows that 


3—T 
ae 


y= (7.5.27) 


Since for the Bethe lattice, y = 1 and o = 4, we find tT = 3. 

From (7.5.24) using the general relation (7.5.7) one can also determine 
Px. While the factor s? in (7.5.25) was sufficient to make the integral con- 
verge at its lower limit, this is not the case in (7.5.7). Therefore, we first write 
(7.5.7) in the form 


Py =1= 2 s(14(0) = nele)) = 2 sav.) 
; s : (7.5.28) 
= 52 (le) ~ malp)) +1 - < 
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has been used. Now the first term in (7.5.28) can be replaced by an integral 


P,. = const. gry z'-7 [1—e7*] dz+ eae 
ae ‘ (7.5.29) 
a 


a 


From this, we find for the exponent defined in Eq. (7.5.1) 


p=" a (7.5.30) 
For the Bethe lattice, one finds once again 3 = 1, in agreement with (7.5.18). 
In the Bethe lattice, the first term in (7.5.29)) also has the form p— p-, while 
in other lattices, the first term, (p — De)” , predominates relative to the second 
due to 6 <1. 

n (7.5.5), we also introduced the average number of clusters per lattice 
site, whose critical percolation behavior is characterized by an exponent a 
via 


N-= =e pple (7.5.31) 


That is, this quantity plays an analogous role to that of the free energy in 
thermal phase transitions. We note that in the case of percolation there are 
no interactions, and the free energy is determined merely by the entropy. 
Again inserting (7.5.24) for the cluster number into (7.5.31), we find 


-—1 
a ae (7.5.32) 
o 
which leads to a = —1 for the Bethe lattice. In summary, the critical expo- 
nents for the Bethe lattice are 
B=1,y=1,a=-1,v=1,7=5/2,0=1/2. (7.5.33) 


7.5.5 General Scaling Theory 


In the preceding section, the exponents for the Bethe lattice (Cayley tree) 
were calculated. In the process, we made some use of a scaling assumption 
(7.5.24). We will now generalize that assumption and derive the consequences 
which follow from it. 

We start with the general scaling hypothesis 


ns(p) = 8-7 f(Ip — pel * 5) , (7.5.34) 
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where + refers to p = p¢.°° The relations (7.5.27), (7.5.30), and (7.5.32), 
which contain only the exponents a, 3, y,0,7, also hold for the general scaling 
hypothesis. The scaling relation for the correlation length and other charac- 
teristics of the extension of the finite clusters must be derived once more. The 
correlation length is the root mean square distance between all the occupied 
sites within the same finite cluster. For a cluster with s occupied sites, the 
root mean square distance between all pairs is 


R= > bm). 


i=1 j=1 


The correlation length € is obtained by averaging over all clusters 
Ste ee We 

ec 87 Ns 
The quantity $8 ns is equal to the number of pairs in clusters n, of size s, 
i.e. proportional to the probability that a pair (in the same cluster) belongs 


to a cluster of the size s. 
The mean square cluster radius is given by 


= (7.5.35) 


— oo 2 
R2 = dani My SMe (7.5.36) 
oat Ss 


since sn, = the probability that an occupied site belongs to an s-cluster. 
The mean square distance increases with cluster size according to 


R,w s/4r , (7.5.37) 
where dy is the fractal dimension. Then it follows from (7.5.35) that 


eS Hp (Ip— alts) /Fellp—pdlF) 
s=1 


ne Ds 
~|p—pe| 47, 2<7<2.5 
1 T-1 
y= = 
dro do ° 


and from (7.5.36), 


>} ati- 4 -2 
R2~ So 8%" " fa(|p— pel” 8) ~ [p—pel ”** 


3° At the percolation threshold p = pe, the distribution of clusters is a power law 
ns(pe) = 8 * f+(0). The cutoff function f+(a) goes to zero for x = 1, for example 
as in (7.5.22) exponentially. The quantity 8maz = |p—po|~/7 refers to the largest 
cluster. Clusters of size s X S8maz are also distributed according to s-* for p £ De, 
and for s = 8maz, Ns(p) vanishes. 
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7.5.5.1 The Duality Transformation and the Percolation 
Threshold 


The computation of p, for bond percolation on a square lattice can be carried 
out by making use of a duality transformation. The definition of the dual 
lattice is illustrated in Fig. 7.26. The lattice points of the dual lattice are 
defined by the centers of the unit cells of the lattice. A bond in the dual 
lattice is placed wherever it does not cross a bond of the lattice; i.e. the 
probability for a bond in the dual lattice is 


Ga LS pe. 


In the dual lattice, there is likewise a bond percolation problem. For p < pe, 
there is no infinite cluster on the lattice, however there is an infinite cluster 
on the dual lattice. There is a path from one end of the dual lattice to the 
other which cuts no bonds on the lattice; thus q > p.. For p — pz from 
below, g — p? arrives at the percolation threshold from above, i.e. 


Pe=1l—pe. 


Thus, p- = 4. This result is exact for bond percolation. 


e lattice : Fig. 7.26. A lattice and its dual lat- 

© dual lattice tice. Left side: A lattice with bonds and 
— bonds in the lattice the dual lattice. Right side: Showing 
---- bonds in the dual lattice also the bonds in the dual lattice 
Remarks: 


(i) By means of similar considerations, one finds also that the percolation threshold 


for site percolation on a triangular lattice is given by pe. = 3. 
(ii) For the two-dimensional Ising model, also, the transition temperatures for a 
series of lattice structures were already known from duality transformations 


before its exact solution had been achieved. 


7.5.6 Real-Space Renormalization Group Theory 


We now discuss a real-space renormalization-group transformation, which 
allows the approximate determination of p. and the critical exponents. 

In the decimation transformation shown in Fig. 7.27 for a square lattice, 
every other lattice site is eliminated; this leads again to a square lattice. In 
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aes e e 
~~ 
ee = e e 


wWHHAwM eee 
p’ 4.p?(1-p) 2p (1-p)? 


Fig. 7.28. Bond configurations which lead to a bond (dashed) on the decimated 
lattice 


Fig. 7.27. A lattice and a dec- 
imated lattice 


the new lattice, a bond is placed between two remaining sites if at least one 
connection via two bonds existed on the original lattice (see Fig. 7.27). The 
bond configurations which lead to formation of a bond (shown as dashed lines) 
in the decimated lattice are indicated in Fig. 7.28. Below, the probability for 
these configurations is given. From the rules shown in Fig. 7.28, we find for 
the probability for the existence of a bond on the decimated lattice 


p' =p* + 4p3(1 — p) + 2p2(1 — p)? = 2p? — p*. (7.5.38) 


From this transformation law*!, one obtains the fixed-point equation p* = 
2p*? — p**. It has the solutions p* = 0, p* = 1, which correspond to the high- 
and low-temperature fixed points for phase transitions; and in addition, the 


+ 

two fixed points p* = Seen, of which only p* = v5=1 = 0.618... is 
acceptable. This value of the percolation threshold differs from the exact 
value found in the preceding section, $. The reasons for this are: (i) sites 
which were connected on the original lattice may not be connected on the 
decimated lattice; (ii) different bonds on the decimated lattice are no longer 
uncorrelated, since the existence of a bond on the original lattice can be 
responsible for the occurrence of several bonds on the decimated lattice. 

The linearization of the recursion relation around the fixed point yields 
vy = 0.817 for the exponent of the correlation length. 
The treatment of site percolation on a triangular lattice in two dimensions is most 
simple. The lattice points of a triangle are combined into a cell. This cell is counted 
as occupied if all three sites are occupied, or if two sites are occupied and one is 
empty, since in both cases there is a path through the cell. For all other configu- 
rations (only one site occupied or none occupied), the cell is unoccupied. For the 


3! A.P. Young and R. B. Stinchcombe, J. Phys. C: Solid State Phys. 8, L 535 (1975). 
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triangular lattice®?, one thus obtains as the recursion relation 

p' =p’ + 3p"(1—p), (7.5.39) 
with the fixed points p* = 0,1, $ This RG transformation thus yields p. = s for 
the percolation threshold, which is identical with the exact value (see remark (i) 
above). The linearization of the RG transformation around the fixed point yields 
the following result for the exponent v of the correlation length: 


—_ log V3 
7 log 3 


= 1.3547. 


This is nearer to the result obtained by series expansion, v = 1.34, as well as to 
the exact result, 4/3, than the result for the square lattice (see the remark on 
universality following Eq. (7.5.3)). 


7.5.6.1 Definition of the Fractal Dimension 


In a fractal object, the mass behaves as a function of the length L of a d-dimensional 
Euclidean section as 


M(L) ~ LS , 


and thus the density is 


pL) = 3 rs Ear cae 


An alternative definition of dy is obtained from the number of hypercubes N(Lm, 6) 
which one requires to cover the fractal structure. We take the side length of the 
hypercubes to be 6, and the hypercube which contains the whole cluster to have 
the side length Lm: 


MES ()" 


i.e. 
fata log N(Lm, 6) 
50 log 6 
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Problems for Chapter 7 


7.1 A generalized homogeneous function fulfills the relation 

fA a1, A? v2) = AY f (a1, £2) . 
Show that (a) the partial derivatives Ao pak f(a, 202) and (b) the Fourier trans- 
form g(ki,22) = f dta:e™*! f(x1, x2) of a generalized homogeneous function are 
likewise homogeneous functions. 


7.2 Derive the relations (7.3.13’) for A’, K’, L’, and M’. Include in the starting 
model in addition an interaction between the second-nearest neighbors L. Compute 
the recursion relation to leading order in K and L, i.e. up to K? and L. Show that 
(7.3.15a,b) results. 


7.3 What is the value of 6 for the two-dimensional decimation transformation from 
Sect. 7.3.3? 


7.4 Show, by Fourier transformation of the susceptibility y(q) = a X(9), that 
the correlation function assumes the form 


G(x) = eae Cllel/8)- 


7.5 Confirm Eq.(7.4.35). 

7.6 Show that 

wr— XO 
2 


is a solution of the Ginzburg-Landau equation (7.4.11). Calculate the free energy 
of the domain walls which it describes. 


m(a) = motanh 


7.7 Tricritical phase transition point. 
A tricritical phase transition point is described by the following Ginzburg-Landau 
functional: 


Fld] = / dn {o(V¢)? + ag? + vd® — ho} 
= = — , v>O0. 


Determine the uniform stationary solution ¢s¢ with the aid of the variational deriva- 
6F 
% 


. / 
witha=aT, T 


tive (S = 0) for h = 0 and the associated tricritical exponents az, 3:,7 and dz. 
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7.8 Consider the extended Ginzburg-Landau functional 


F(d] = | d’a{c(V9)? + ad* + us + v6" — ho}. 


(a) Determine the critical exponents a, 3, 7 and 6 for u > 0 in analogy to problem 
7.7. They take on the same values as in ‘the ¢* model (see Sect. 4.6); the term ~ ¢° 
is irrelevant, i.e. it yields only corrections to the scaling behavior of the ¢* model. 
Investigate the “crossover” of the tricritical behavior for h = 0 at small u. Consider 
the crossover function ™m(a), which is defined as follows: 


eq(u,T) = Ge(r)- mw) with $i(T) = Geq(u=0,7)~ 7%, = 


u 
J 3lalv | 
(b) Now investigate the case u < 0,h = 0. Here, a first-order phase transition 
occurs; at T., the absolute minimum of F changes from ¢=0to d=¢°. Calculate 
the shift of the transition temperature T. — To and the height of the jump in the 


order parameter ¢°. Critical exponents can also be defined for the approach to the 
tricritical point by variation of u 


1 
enw lu, T3o—To~ |ul® . 


Give expressions for @, and the “shift exponent” w. 
(c) Calculate the second-order phase transition lines for u < 0 and h # 0 by 
deriving a parameter representation from the conditions 

OF ag OF. 

0¢2 COB 
(d) Show that the free energy in the vicinity of the tricritical point obeys a gener- 
alized scaling law 


Flea] = FP F(a) 


by inserting the crossover function found in (a) into F (¢; is called the “crossover 
exponent” ). Show that the scaling relations 


6=14+2, a+264+7=2 


are obeyed in (a) and at the tricritical point (problem 7.7). 
(e) Discuss the hysteresis behavior for a first-order phase transition (u < 0). 


7.9 In the Ginzburg-Landau approximation, the spin-spin correlation function is 
given by 
! _ 1 ik(x—x’) 1 F zoe 
(m(x)m(x')) = 75 Doe Bele? +B) } CO Te) « 


|k| <A 


(a) Replace the sum by an integral. 
(b) show that in the limit is — oo, the following relation holds: 


(m( aca ae Qt 
(c) Show that for d = 3 and large €, 
Leck 


(m(x)m(x')) = 
holds. 


8rcB |x —x’| 
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7.10 Investigate the behavior of the following integral in the limit € — oo: 


ie d4q gtd 
o (27)4 (1+4q?)?’ 


by demonstrating that: 

(a) T«é*4 , d<4; 
(bl) ITxmé , d=4; 

(c) lx A— Bé*"4, d>4. 


7.11 The phase transition of a molecular zipper from C. Kittel, American Journal 
of Physics 37, 917, (1969). 

A greatly simplified model of the helix-coil transition in polypeptides or DNA, which 
describes the transition between hydrogen-bond stabilized helices and a molecular 
coil, is the “molecular zipper”. 

A molecular zipper consists of N bonds which can be broken from only one 
direction. It requires an energy ¢€ to break bond p + 1 if all the bonds 1, ... , p are 
broken, but an infinite energy if the preceding bonds are not all broken. A broken 
bond is taken to have G orientations, i.e. its state is G—fold degenerate. The zipper 
is open when all N — 1 bonds are broken. 


prt N-1 


(a) Determine the partition function 


Z= ; v= Gexp(—-e@). 


(b) Determine the average number (s) of broken bonds. Investigate (s) in the 
vicinity of z- = 1. Which value does (s) assume at x-, and what is the slope there? 
How does (s) behave at x >> l anda <1? 

(c) What would be the partition function if the zipper could be opened from both 
ends? 


7.12 Fluctuations in the Gaussian approximation below Ty. 
Expand the Ginzburg-Landau functional 


F[m] = [ae [am (2)? + pm(x)! +¢e(Vm(z))? — hm(c)| ; 


which is O(n)-symmetrical for h = 0, up to second order in terms of the fluctuations 
of the order parameter m/(x). Below Ty, 


m(z) =mye, + m (x) ; h= 2(a + bmi)mi 


holds. 
(a) Show that for h — 0, the long-wavelength (k — 0) transverse fluctuations 
mi, (i = 2,...,n) require no “excitation energy” (Goldstone modes), and determine 


the Gibbs free energy. In which cases do singularities occur? 
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(b) What is the expression for the specific heat cp29 below T- in the harmonic 
approximation? Compare it with the result for the disordered phase. 

(c) Calculate the longitudinal and transverse correlation functions relative to the 
spontaneous magnetization m1 


G\ (a — 2’) = (mi (x)mi(2’)) and 
Giij(e— 2’) = (mi(x)mj(a')), i,9 =2....,n 


for d = 3 from its Fourier transform in the harmonic approximation. Discuss in 
particular the limiting case h — 0. 


7.13 The longitudinal correlation function below Ty. 
The results from problem 7.12 lead us to expect that taking into account the trans- 
verse fluctuations just in a harmonic approximation will in general be insufficient. 
Anharmonic contributions can be incorporated if we fix the length of the vector 
m(a) (h = 0), as in the underlying Heisenberg model: 


mi(x)” + So mi(x)? = m3 = const. 


Compute the Fourier transform G'(k), by factorizing the four-spin correlation func- 
tion in a suitable manner into two-spin correlation functions 


n 


z >, (mi(x)*m;(2’)”) 


i,j=2 


1 


and inserting 

d*k eik(x—x’) 
G \ = 
Nee) i 2Bck? 


Remark: for n > 2 and 2 < d < 4, the relations G1 (k) « z and G'y « cor are 
fulfilled exactly in the limit k — 0. 


7.14 Verify the second line in Eq. (7.5.22) . 


7.15 The Hubbard-Stratonovich transformation: using the identity 
ve 1 = 
exp - ys 14818; = const. if (II dms) exp{ . meJigims ; 
i,j ae aj 


show that the partition function of the Ising Hamiltonian H = 7, ; JijSiS; can be 
written in the form 


Z= const. | (I] dm:) exp{H’ ({m}) } ‘ 


Give the expansion of H’ in terms of m; up to the order O(m*). Caveat: the Ising 
Hamiltonian must be extended by terms with J;; so that the matrix Jj; is positive 
definite. 
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7.16 Lattice-gas model. The partition function of a classical gas is to be mapped 
onto that of an Ising magnet. 

Method: the d-dimensional configuration space is divided up into N cells. In each 
cell, there is at most one atom (hard core volume). One can imagine a lattice in 
which a cell is represented by a lattice site which is either empty or occupied (n; = 0 
or 1). The attractive interaction U(x; — x;) between two atoms is to be taken into 
account in the energy by the term $U2(i, j)ninj. 

(a) The grand partition function for this problem, after integrating out the kinetic 
energy, is given by 


Zo = (11 ! exp[-8(— > n: + 5 Ualis)nins)| : 


Ne ef Qrh 
ji = kT log zvo = kT lo ( e Zz ; 
e ees ner da InmkT 


where vo is the volume of a cell. 
(b) By introducing spin variables 9; (ni = $(1+ Si), 9; = £1), bring the grand 
partition function into the form 


Gas (II S~ ) exw[-(8 ~ Das, - 2 4488) , 


i=1 S;=-1,1 


Calculate the relations between Eo,h, J and su, U2, vo. 

(c) Determine the vapor-pressure curve of the gas from the phase-boundary curve 
h = 0 of the ferromagnet. 

(d) Compute the particle-density correlation function for a lattice gas. 


7.17 Demonstrate Eq. (7.4.63) using scaling relations. 


7.18 Show that from (7.4.68) in the limit of small k and for h = 0, the longitudinal 
correlation function 
1 


Gi(k) & Fae 


follows. 


7.19 Shift of T. in the Ginzburg-Landau Theory. Start from Eq. (7.4.1) and use 
the so called quasi harmonic approximation in the paramagnetic phase. There the 
third (nonlinear) term in (7.4.1) is replaced by 6b < m(x)? > m(x). 

(a) Justify this approximation. 

(b) Compute the transition temperature T; and show that T. < ae 


7.20 Determine the fixed points of the transformation equation (7.5.38). 


8. Brownian Motion, Equations of Motion, and 
the Fokker—Planck Equations 


The chapters which follow deal with nonequilibrium processes. First, in chap- 
ter 8, we treat the topic of the Langevin equations and the related Fokker— 
Planck equations. In the next chapter, the Boltzmann equation is discussed; 
it is fundamental for dealing with the dynamics of dilute gases and also for 
transport phenemona in condensed matter. In the final chapter, we take up 
general problems of irreversibility and the transition to equilibrium. 


8.1 Langevin Equations 


8.1.1 The Free Langevin Equation 
8.1.1.1 Brownian Motion 


A variety of situations occur in Nature in which one is not interested in the 
complete dynamics of a many-body system, but instead only in a subset of 
particular variables. The remaining variables lead through their equations of 
motion to relatively rapidly varying stochastic forces and to damping effects. 
Examples are the Brownian motion of a massive particle in a liquid, the 
equations of motion of conserved densities, and the dynamics of the order 
parameter in the vicinity of a critical point. 

We begin by discussing the Brownian motion as a basic example of a 
stochastisic process. A heavy particle of mass m and velocity v is supposed 
to be moving in a liquid consisting of light particles. This “Brownian particle” 
is subject to random collisions with the molecules of the liquid (Fig. 8.1). The 
collisions with the molecules of the liquid give rise to an average frictional 
force on the massive particle, a stochastic force f(t), which fluctuates around 
its average value as shown in Fig. 8.2. The first contribution —m¢v to this 
force will be characterized by a coefficient of friction ¢. Under these physical 
conditions, the Newtonian equation of motion thus becomes the so called 
Langevin equation: 


mo = —m¢u + f(t). (8.1.1) 
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Fig. 8.1. The Brownian motion Fig. 8.2. Stochastic forces in Brown- 
ian motion 


Such equations are referred to as stochastic equations of motion and the 
processes they describe as stochastic processes. ! 

The correlation time 7. denotes the time during which the fluctuations 
of the stochastic force remain correlated?. From this, we assume that the 
average force and its autocorrelation function have the following form at 
differing times? 


(f(t)) =0 
(FOFE)) = ot -#) . 
Here, ¢(r) differs noticeably from zero only for tT < 7, (Fig. 8.3). Since we 
are interested in the motion of our Brownian particle over times of order t 


which are considerably longer than 7,., we can approximate ¢(r) by a delta 
function 


d(T) = Ad(r) . (8.1.3) 


(8.1.2) 


The coefficient A is a measure of the strength of the mean square deviation 
of the stochastic force. Since friction also increases proportionally to the 
strength of the collisions, there must be a connection between X and the 
coefficient of friction ¢. In order to find this connection, we first solve the 
Langevin equation (8.1.1). 


' Due to the stochastic force in Eq. (8.1.1), the velocity is also a stochastic quantity, 
i.e. a random variable. 
Under the precondition that the collisions of the liquid molecules with the Brow- 
nian particle are completely uncorrelated, the correlation time is roughly equal 

—6 

ie cmjace = 
10-1" sec, where a is the radius of the massive particle and @ the average velocity 
of the molecules of the medium. 
The mean value ( ) can be understood either as an average over independent 
Brownian particles or as an average over time for a single Brownian particle. In 
order to fix the higher moments of f(t), we will later assume that f(t) follows a 
Gaussian distribution, Eq. (8.1.26). 


2 


to the duration of a collision. For this time, we obtain T. & ¢ = 
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o(t) 


Fig. 8.3. The correlation of 
Te t the stochastic forces 


8.1.1.2 The Einstein Relation 


The equation of motion (8.1.1) can be solved with the help of the retarded 
Green’s function G(t), which is defined by 


G+¢G=<46(t), G@)=e@e“. (8.1.4) 


Letting vo be the initial value of the velocity, one obtains for v(t) 


st ajertis: | * dr G(t— 1) f(r)/m 


= ue Site S fo eS? f(r)/m. (8.1.5) 
0 


Since the dependence of f(T) is known only statistically, we do not consider 
the average value of v(t), but instead that of its square, (v(t)?) 


t t 
1 1 
(v(t)?) = eae ar f dr! S47) g(r — ie + ze 2st ; 


here, the cross term vanishes. With Eq. (8.1.3), we obtain 


(v(t)’) = aa (1 e786) t vge 76 aa A 


Tee (8.1.6) 


For t > ¢7!, the contribution of vg becomes negligible and the memory of 
the initial value is lost. Hence ¢~! plays the role of a relaxation time. 

We require that our particle attain thermal equilibrium after long times, 
t > ¢~1, ie. that the average value of the kinetic energy obey the equiparti- 
tion theorem 


=m(v(t)*) = Shr : (8.1.7) 
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From this, we find the so called Einstein relation 
A= 2¢mkT . (8.1.8) 


The coefficient of friction ¢ is proportional to the mean square deviation A 
of the stochastic force. 


8.1.1.3 The Velocity Correlation Function 


Next, we compute the velocity correlation function: 
i Pe iat 4 
(v(t)u(t!)) = e St) i dr | dr’ S47) — §(r—=7')+ugeS") | (8.1.9) 
0 0) m 


Since the roles of t and t’ are arbitrarily interchangeable, we can assume with- 
out loss of generality that t < t’ and immediately evaluate the two integrals 
in the order given in this equation, with the result (e7¢ min(t,t') _ 1) son , thus 
obtaining finally 


(v(t)v(t’)) —A_o-¢lt-#' + (+3 ees eee ; (8.1.10) 
For t,t’ > ¢~', the second term in (8.1.10) can be neglected. 


8.1.1.4 The Mean Square Deviation 


In order to obtain the mean square displacement for t >> ¢~!, we need only 
integrate (8.1.10) twice, 


t t Xr ) 
(oo?) = far f dr' ce (8.1.11) 


Intermediate calculation for integrals of the type 


pe fear [rar sr—7) 


We denote the parent function of f(r) by F(7) and evaluate the integral over 7, 
I= Soar’ (F(t — 7’) — F(—r’)). Now we substitute u = t— 7’ into the first term 
and obtain after integrating by parts 


p= i du (F(u) — F(—u)) = 4(F() — F(-#)) i duu(f(u) + f(—u)) 


0 


and from this the final result 


[oa fae te) = fawn + sw). (8.1.12) 
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With Eq. (8.1.12), it follows for (8.1.11) that 


mN : A 
ow) = sa? | du(t—u)eS* = ame 


or 
(x? (t)) = 2Dt (8.1.13) 


with the diffusion constant 


r kT 
=—= SS = 8.1.14 
2¢2m2 Cm ( ) 
It can be seen that D plays the role of a diffusion constant by starting from 
the equation of continuity for the particle density 


n(x) + Vj(x) =0 (8.1.15a) 
and the current density 

j(xz) = —DVn(z) . (8.1.15b) 
The resulting diffusion equation 

n(x) = DV? n(x) (8.1.16) 


has the one-dimensional solution 


Na 
= 4Dt 
n(x, t) TRE : (8.1.17) 
The particle number density n(x, t) from Eq. (8.1.17) describes the spreading 
out of N particles which were concentrated at x = 0 at the time t = 0 
(n(a,0) = Né(x)). That is, the mean square displacement increases with 
time as 2Dt. (More general solutions of (8.1.16) can be found from (8.1.17) 
by superposition.) 
We can cast the Einstein relation in a more familiar form by introducing 
the mobility ys into (8.1.1) in place of the coefficient of friction. The Langevin 
equation then reads 


1 
mé=—p e+ f with p= oe (8.1.18) 


and the Einstein relation takes on the form 
D=wkT. (8.1.19) 


The diffusion constant is thus proportional to the mobility of the particle and 
to the temperature. 
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Remarks: 


(i) In a simplified version of Einstein’s* historical derivation of (8.1.19), 
we treat (instead of the osmotic pressure in a force field) the dynamic origin 
of the barometric pressure formula. The essential consideration is that in a 
gravitational field there are two currents which must compensate each other 
in equilibrium. They are the diffusion current —D £n(z) and the current of 
particles falling in the gravitational field, jn(z). Here, n(z) is the particle 
number density and 0 is the mean velocity of falling, which, due to friction, 
is found from ~~! = —mg. Since the sum of these two currents must vanish, 
we find the condition 


Do n(2) —mgpun(z) =0. (8.1.20) 


From this, the barometric pressure formula n(z) x e~ *? is obtained if the 
Einstein relation (8.1.19) is fulfilled. 

(ii) In the Brownian motion of a sphere in a liquid with the viscosity 
constant 7, the frictional force is given by Stokes’ law, Fy, = 67anxz, where a 
is the radius and « the velocity of the sphere. Then the diffusion constant is 
D =kT/6zan and the mean square displacement of the sphere is given by 

kTt 


Kas ES aa (8.1.21) 


Using this relation, an observation of (x?(t)) allows the experimental deter- 
mination of the Boltzmann constant k. 


8.1.2 The Langevin Equation in a Force Field 


As a generalization of the preceding treatment, we now consider the Brownian 
motion in an external force field 


OV 
Then the Langevin equation is given by 
mé = —m¢é + F(x) + f(t), (8.1.22b) 


where we assume that the collisions and frictional effects of the molecules 
are not modified by the external force and therefore the stochastic force f(t) 
again obeys (8.1.2), (8.1.3), and (8.1.8).° 

An important special case of (8.1.22b) is the limiting case of strong damp- 
ing ¢. When the inequality m¢a >> md is fulfilled (as is the case e.g for 
periodic motion at low frequencies), it follows from (8.1.22b) that 


* See the reference at the end of this chapter. 
° We will later see that the Einstein relation (8.1.8) ensures that the function 


exp(—(2 +V(x))/kT) be an equilibrium distribution for this stochastic process. 
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#=-TZ-+r(0), (8.1.23) 


where the damping constant I’ and the fluctuating force r(t) are given by 


1 
I= —, and r(t) = — f(t). (8.1.24) 
me me 
The stochastic force r(t), according to Eqns. (8.1.2) and (8.1.3), obeys the 
relation 


t 
ae (8.1.25) 
(r(t)r(t’)) = 2VkT6(t —t’) . 
For the characterization of the higher moments (correlation functions) of 
r(t), we will further assume in the following that r(¢) follows a Gaussian 
distribution 


Plr(t)] = 07 Lig are (8.1.26) 


P|r(t)] gives the probability density for the values of r(t) in the interval 
[tot], where to and ty are the initial and final times. To define the functional 
integration, we subdivide the interval into 


tr — to 


N= A 


small subintervals of width A and introduce the discrete times 
t; =to +iA, 7=0,...,.N—-1. 


The element of the functional integration D[r] is defined by 


A-—-0 3 
— 


N-1 
Dlr] = lim (ute wi) : (8.1.27) 


The normalization of the probability density is 


2 
= AL (ti) 
[Pelle t)] = lim, TI [f (we aren) e LiAarer = 1. (8.1.28) 


As a check, we calculate 


AD kT 63; 
(r(ti)r(ts)) = So —by = 2reT 


which is in agreement with Eqns. (8.1.2), (8.1.3) and (8.1.8). 


OER SG —4,) 
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Since Langevin equations of the type (8.1.23) occur in a variety of different 
physical situations, we want to add some elementary explanations. We first 
consider (8.1.23) without the stochastic force, ie. ¢ = re. In regions of 
positive (negative) slope of V(x), x will be shifted in the negative (positive) 
x direction. The coordinate x moves in the direction of one of the minima 
of V(a) (see Fig. 8.4). At the extrema of V(x), « vanishes. The effect of 
the stochastic force r(t) is that the motion towards the minima becomes 
fluctuating, and even at its extreme positions the particle is not at rest, 
but instead is continually pushed away, so that the possibility exists of a 
transition from one minimum into another. The calculation of such transition 
rates is of interest for, among other applications, thermally activated hopping 
of impurities in solids and for chemical reactions (see Sect. 8.3.2). 


V (x) 


tion resulting from 
the equation of mo- 
tion ¢ = —'OV/da. 


\ Pe Fig. 8.4. The mo- 
\ 


8.2 The Derivation of the Fokker—Planck Equation from 
the Langevin Equation 


Next, we wish to derive equations of motion for the probability densities in 
the Langevin equations (8.1.1), (8.1.22b), and (8.1.23). 


8.2.1 The Fokker—Planck Equation for the Langevin Equation 
(8.1.1) 
We define 

P(é,t) = (8(E- v(t), (8.2.1) 


the probability density for the event that the Brownian particle has the ve- 
locity € at the time t. This means that P(€,t)d& is the probability that the 
velocity lies within the interval [€, € + dé]. 
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We now derive an equation of motion for P(€,t): 


0 


FPlEst) = -F(5(E - vO) ol) 
= aie v()) (Cult) + = 10)) 
- ele v(t)) (-ce + +409) 
1 0 
= sel(6PE 1) - TF (5(E- vO) FO) (8.2.2) 


where the Langevin equation (8.1.1) has been inserted in the second line. To 
compute the last term, we require the probability density for the stochastic 
force, assumed to follow a Gaussian distribution: 


PLf()] = en So emer (8.2.3) 


The averages (...) are given by the functional integral with the weight (8.2.3) 
(see Eq. (8.1.26)). In particular, for the last term in (8.2.2), we obtain 


(( — v(t) fF) = / DIF (t)] 6(€ ny f(t)e~ Sere 


= —2¢mkT [rire 5(€ — v(t) sin - {ee 
= amar f DIg(ey] Wm 5(6 — v(d) 


_ 6 7 6) du(t) 
= roma ( ep AlE - o()) = —2¢mkT Fe ((6 v(t)) sre) 
(8.2.4) 
Here, we have to use the solution (8.1.5) 
v(t) = vpe$* +/ dr G(t — ni (8.1.5) 
0 
and take the derivative with respect to f(t). With “ou = 6(7—t) and (8.1.4), 
we obtain 
u(t) if peal wal 
ray: dre ot T)= Sm’ (8.2.5) 


The factor $ results from the fact that the integration interval includes only 
half of the 6-function. Inserting (8.2.5) into (8.2.4) and (8.2.4) into (8.2.2), we 
obtain the equation of motion for the probability density, the Fokker—Planck 
equation: 

0 ) KPO? 


prema i) = Ca, UP t) + C Bp2t (vt) F (8.2.6) 
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Here, we have replaced the velocity € by v; it is not to be confused with the 
stochastic variable v(t). This relation can also be written in the form of an 
equation of continuity 


6) O kT O 
Remarks: 


(i) The current density, the expression in large parentheses, is composed of 
a drift term and a diffusion current. 

(ii) The current density vanishes if the probability density has the form 
P(v,t) « e-%r. The Macwell distribution is thus (at least one) equi- 
librium distribution. Here, the Einstein relation (8.1.8) plays a decisive 
role. Conversely, we could have obtained the Einstein relation by requir- 
ing that the Maxwell distribution be a solution of the Fokker—Planck 
equation. 

(iii) We shall see in Sect. 8.3.1 that P(v,t) becomes the Maxwell distribution 
in the course of time, and that the latter is therefore the only equilibrium 
distribution of the Fokker—Planck equation (8.2.6). 


8.2.2 Derivation of the Smoluchowski Equation for the 
Overdamped Langevin Equation, (8.1.23) 


For the stochastic equation of motion (8.1.23), 


t= ag OU +r(t), (8.1.23) 
Ox 
we can also define a probability density 
P(é,t) = (6(€ - 2(2))) , (8.2.8) 


where P(€,t)d& is the probability of finding the particle at time t at the 
position € in the interval d€. We now derive an equation of motion for P(€, t), 


performing the operation (F(x) = —aV) 


) 0 
BP Et) = ~ 5g (O(E — 2(6)) #00) 


a 
= au a(t)) (°K (a) + r(t))) 

a a 
= ge (PPE OKO) ~ a6 (O(E —x(t))r(t)) . (8.2.9) 


The overdamped Langevin equation was inserted in the second line. For the 
last term, we find in analogy to Eq. (8.2.4) 
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(5(€ — e(t))r(t)) = arer( =a o(é 7 x(t))) 


7 6) OLE) Ne 6) 
= rer z-(5(6 x(t)) er(d) ) = -TRTZ-P(G,t) « (8.2.10) 
Here, we have integrated (8.1.23) between 0 and ¢, 
x(t) = x(0) + [ dr (('K(x(r)) +r(7)) , (8.2.11) 
from which it follows that 
da(t) f° (OP F(a(r)) dx(r) be Ve 
S(t) =| ( er + d(t ) dr. (8.2.12) 
da(T) 


The derivative is Co ae 0 for 7 < t’ due to causality and is nonzero only for 
7 >t’, with a finite value at 7 = t’. We thus obtain 
dx(t) * AP F(x(r)) dx(r) 


— = : 2.1 
ir(t?) eae 7c ee for t'<t (8.2.13a) 


and 
da(t) — [OL F(a(r)) 62(r) ae co 
or(t’) =| Ox(t)  dr(t’) "7 9 for t=t. (8.2.13b) 
0 for t/=t 


This demonstrates the last step in (8.2.10). From (8.2.10) and (8.2.9), we 
obtain the equation of motion for P(€,t), the so called Smoluchowski equation 


0 0 0? 
—P(€é,t) =-=(TP(E,t)F VkT— P(E,t) . 8.2.14 
Remarks: 
(i) One can cast the Smoluchowski equation (8.2.14) in the form of an equa- 
tion of continuity 


O 0. 

at = ~ 9,44) ; (8.2.15a) 
with the current density 

j(a,t) = —I (arg - K(e)) P(a,t). (8.2.15b) 


The current density j(x,t) is composed of a diffusion term and a drift 
term, in that order. 
(ii) Clearly, 


P(a,t) eV @)/kP (8.2.16) 


is a stationary solution of the Smoluchowski equation. For this solution, 
j(a,t) is zero. 
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8.2.3 The Fokker—Planck Equation for the Langevin Equation 
(8.1.22b) 


For the general Langevin equation, (8.1.22b), we define the probability den- 
sity 


P(a,v,t) = (d(a — x(t))d(u — v(#))) . (8.2.17) 


Here, we must distinguish carefully between the quantities x and v and the 
stochastic variables x(t) and v(t). The meaning of the probability density 
P(a,v,t) can be characterized as follows: P(x, v,t)dxdv is the probability of 
finding the particle in the interval [7,2 + dz] with a velocity in [v,v + dv}. 
The equation of motion of P(z,v,t), the generalized Fokker—Planck equation 


a OP F(x) OP kT @P 
8) | 


Free 2 Ox m Ov2 


=¢ oop a (8.2.18) 


m Ov 


follows from a series of steps similar to those in Sect. 8.2.2; see problem 8.1. 


8.3 Examples and Applications 


In this section, the Fokker—Planck equation for free Brownian motion will 
be solved exactly. In addition, we will show in general for the Smoluchowski 
equation that the distribution function relaxes towards the equilibrium situa- 
tion. In this connection, a relation to supersymmetric quantum mechanics will 
also be pointed out. Furthermore, two important applications of the Langevin 
equations or the Fokker—Planck equations will be given: the transition rates 
in chemical reactions and the dynamics of critical phenomena. 


8.3.1 Integration of the Fokker—Planck Equation (8.2.6) 


We now want to solve the Fokker—Planck equation for the free Brownian 
motion, (8.2.6): 


P(v) =¢2 {Pe — ; (8.3.1) 


mu2 
We expect that P(v) will relax towards the Maxwell distribution, e~ 27 , 
following the relaxation law e~S*. This makes it reasonable to introduce the 
variable p = veS* in place of v. Then we have 


P(v,t) = P(pe‘*, t) = Y(p,t) , (8.3.2a) 
OP OV gs OP Oy. 
Ov Op’ dv? ~— dp? 
OP aY ap | oy _ oY . Ped 
dt Op dot. ot dp? Ot’ 


erst (8.3.2b) 


(8.3.2c) 
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Inserting (8.3.2a-c) into (8.2.6) or (8.3.1), we obtain 


OY kT O?PY 


This suggests the substitution Y = xe$*. Due to ane a OX Qt +C¢Y, it follows 


that 
2 
a Gs aoe (8.3.4) 


Now we introduce a new time variable by means of dd = e75‘dt 


1 


d= 5, (-1), (8.3.5) 
where W(t = 0) = 0. We then find from (8.3.4) the diffusion equation 
Ox kT 07x 
with its solution known from (8.1.17), 
1 (e-p0)? kT 
re! Se ee 8.3.7 
x(p,¥) nae ee lek (8.3.7) 


By returning to the original variables v and t, we find the following solu- 

tion 
2 m(v—vge7$*y? 
P(v,t) = yest = ¢§ —__™ __N nergy 8.3.8 
td) ='xe | SR Tenet y © oe) 

of the Fokker—Planck equation (8.2.6), which describes Brownian motion in 
the absence of external forces. The solution of the Smoluchowski equation 
(8.2.14) for a harmonic potential is also contained in (8.3.8). 

We now discuss the most important properties and consequences of the 
solution (8.3.8): 

In the limiting case t — 0, we have 


him P(v,t) = d(v — v9) . (8.3.9a) 


In the limit of long times, t — oo, the result is 


1 
: _ omy? /2kT m 2 
lim P(v,t) =¢ (<=) (8.3.9b) 


Remark: Since P(v,t) has the property (8.3.9a), we also have found the conditional 
probability density in (8.3.8)° 


® The conditional probability P(v,t|vo, to) gives the probability that at time t the 
value v occurs, under the condition that it was vo at the time to. 
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P(v, t|vo, to) = P(v,t — to) . (8.3.10) 


This is not surprising. Since, as a result of (8.1.1), (8.1.2) and (8.1.3), a Markov pro- 
cess” is specified, P(v, t|vo, to) likewise obeys the Fokker—Planck equation (8.2.6). 


For an arbitrary integrable and normalized initial probability density 
p(vp) at time to 


favoolvn) =1 (8.3.11) 


we find with (8.3.8) the time dependence 


1) = | dvoP( v,t —to)p(vo) . (8.3.12) 
Clearly, p(v,t) fulfills the initial condition 
lim p(v, t) = p(vo) , (8.3.13a) 


while for long times 


jim p(v,t) =e” 27 ( = y favo) =e 2T ( 
— Co 


i 
m \3 
QrkT Ee <oetg?) 
the Maxwell distribution is obtained. Therefore, for the Fokker—Planck equa- 
tion (8.2.6), and for the Smoluchowski equation with an harmonic potential, 
(8.2.14), we have proved that an arbitrary initial distribution relaxes towards 
the Maxwell distribution, (8.3.13b). 

The function (8.3.8) is also used, by the way, in Wilson’s exact renor- 

malization group transformation for the continuous partial elimination of 
short-wavelength critical fluctuations.® 


8.3.2 Chemical Reactions 


We now wish to calculate the thermally activated transition over a barrier 
(Fig. 8.5). An obvious physical application is the motion of an impurity atom 
in a solid from one local minimum of the lattice potential into another. Cer- 
tain chemical reactions can also be described on this basis. Here, x refers to 
the reaction coordinate, which characterizes the state of the molecule. The 
vicinity of the point A can, for example, refer to an excited state of a molecule, 
while B signifies the dissociated molecule. The transition from A to B takes 
place via configurations which have higher energies and is made possible by 
the thermal energy supplied by the surrounding medium. We formulate the 
following calculation in the language of chemical reactions. 


” A Markov process denotes a stochastic process in which all the conditional prob- 
abilities depend only on the last time which occurs in the conditions; e.g. 


P(ts, vs|t2, v2; t1, 01) = P(ts, vs|te, v2) , 


where t; < tg < ts. 
8 K.G. Wilson and J. Kogut, Phys. Rep. 12C, 75 (1974). 
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Fig. 8.5. A thermally activated tran- 
X sition over a barrier from the mini- 
mum A into the minimum B 


Xg XB 


We require the reaction rate (also called the transition rate), i.e. the tran- 
sition probability per unit time for the conversion of type A into type B. 
We assume that friction is so strong that we can employ the Smoluchowski 
equation (8.2.15a,b), 


P=-—j(a). (8.2.15a) 


Integration of this equation between the points a and ( yields 
d [* 
al dP 25 aya (any (8.3.14) 


where xg lies between the points A and B. It then follows that j(xg) is the 
transition rate between the states (the chemical species) A and B. 

To calculate j(ag), we assume that the barrier is sufficiently high so that 
the transition rate is small. Then in fact all the molecules will be in the region 
of the minimum A and will occupy states there according to the thermal 
distribution. The few molecules which have reached state B can be imagined 
to be filtered out. The strategy of our calculation is to find a stationary 
solution P(a) which has the properties 


1 
P(x) = aoe kT in the vicinity of A (8.3.15a) 
P(x) =0 in the vicinity of B. (8.3.15b) 


From the requirement of stationarity, it follows that 


6) QO OV 
=f. kT P 3.1 
=( a+t) j ee T0) 
from which we find by integrating once 
QO OV ' 
rT (rz. + x) P=-—jo. (8.3.17) 


The integration constant jo plays the role of the current density which, owing 
to the fact that (8.2.14) is source-free between A and B, is independent of x. 
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This integration constant can be determined from the boundary conditions 
given above. We make use of the following Ansatz for P(x): 


P(x) =e Y/*T p (8.3.18) 
in equation (8.3.17) 


9 p___Jo_v@/er (8.3.17’a) 


Integrating this equation from A to x, we obtain 


P(a) = const. — or i: dx eV @)/kT (8.3.17’b) 
The boundary condition at A, that there P follows the thermal equilibrium 
distribution, requires that 

1 


const. = [,deeVPT a (8.3.19a) 


Here, [ 4 means that the integral is evaluated in the vicinity of A. If the 
barrier is sufficiently high, contributions from regions more distant from the 
minimum are negligible?. The boundary condition at B requires 


- B 
0 =e Va/kT (coms. ae toate) ; (8.3.19b) 
so that 
kT d —V(a)/kT —i 
(ee (fadee a (8.3.20) 


Side eV (#)/kT 


For V(x) in the vicinity of A, we set V4(x) = L(2nv)*2?, and, without loss 
of generality, take the zero point of the energy scale at the point A. We then 
find 


[ dne-vane = i. dar e7 2 (27Y)? 2? /kT ee kT ; 
A —oo V2TV 


Here, the integration was extended beyond the neighborhood of A out to 
[—o0, co], which is permissible owing to the rapid decrease of the integrand. 
The main contribution to the integral in the denominator of (8.3.20) comes 


° Inserting (8.3.17'b) with (8.3.20) into (8.3.18), one obtains from the first term in 
the vicinity of point A just the equilibrium distribution, while the second term 
is negligible due to [idx a | Sida eV/ET 1, 
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from the vicinity of the barrier, where we set V(x) ~ A— (2mv')?x?/2. Here, 
A is the height of the barrier and v'” characterizes the barrier’s curvature 


ab 


B oo 12-2 J 
_ Qnv')P« kT 
| da eV me gA/AT f dxe7 T= ekT —__., 
A —oo V2’ 


This yields all together for the current density or the transition rate!® 
jo = 2avv'Te~A/*F ” (8.3.21) 


We point out some important aspects of the thermally activated transition 
rate: the decisive factor in this result is the Arrhenius dependence e~4/*? , 
where A denotes the barrier height, i.e. the activation energy. We can rewrite 


the prefactor by making the replacements (27v)” = mw”, (2rv’/)” = mw!” and 
r= 2 (Eq. (8.1.24): 
Wiel A/T (8.3.21') 


Jor 2n¢ 


If we assume that w’ + w, then the prefactor is proportional to the square of 
the vibration frequency characterized by the potential well." 


8.3.3 Critical Dynamics 


We have already pointed out in the introduction to Brownian motion that 
the theory developed to describe it has a considerably wider significance. 
Instead of the motion of a massive particle in a fluid of stochastically colliding 
molecules, one can consider quite different situations in which a small number 
of relatively slowly varying collective variables are interacting with many 
strongly varying, rapid degrees of freedom. The latter lead to a damping of 
the collective degrees of freedom. 

This situation occurs in the hydrodynamic region. Here, the collective 
degrees of freedom represent the densities of the conserved quantities. The 
typical time scales for these hydrodynamic degrees of freedom increase with 
decreasing q proportionally to 1/q or 1/q?, where q is the wavenumber. In 
comparison, in the range of small wavenumbers all the remaining degrees of 
freedom are very rapid and can be regarded as stochastic noise in the equa- 
tions of motion of the conserved densities. This then leads to the typical form 
of the hydrodynamic equations with damping terms proportional to q? or, 
in real space, ~ V?. We emphasize that “hydrodynamics” is by no means 
limited to the domain of liquids or gases, but instead, in an extension of its 


10 H. A. Kramers, Physica 7, 284 (1940) 

" w is the frequency (attempt frequency) with which the particle arrives at the 
right side of the potential well, from where it has the possibility (with however 
a small probability ~ e~4/ kT’) of overcoming the barrier. 
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original meaning, it includes the general dynamics of conserved quantities de- 
pending on the particular physical situation (dielectrics, ferromagnets, liquid 
crystals, etc.). 

A further important field in which this type of separation of time scales 
occurs is the dynamics in the neighborhood of critical points. As we know 
from the sections on static critical phenomena, the correlations of the local 
order parameter become long-ranged. There is thus a fluctuating order within 
regions whose size is of the order of the correlation length. As these correlated 
regions grow, the characteristic time scale also increases. Therefore, the re- 
maining degrees of freedom of the system can be regarded as rapidly varying. 
In ferromagnets, the order parameter is the magnetization. In its motions, the 
other degrees of freedom such as those of the electrons and lattice vibrations 
act as rapidly varying stochastic forces. 

In ferromagnets, the magnetic susceptibility behaves in the vicinity of the 
Curie point as 


: (8.3.22a) 
X~ AO T. 3.22a 
and the correlation function of the magnetization as 
e- |x| /€ 
x 


In the neighborhood of the critical point of the liquid-gas transition, the 
isothermal compressibility diverges as 


: (8.3.22c) 
Kr ~ 3. 
Cee - 
and the density-density correlation function has the dependence 
e7lxl/é 


In Eqns. (8.3.22 b,d), € denotes the correlation length, which behaves as 
1 
€~(f-—T,) ? in the molecular field approximation, cf. Sects. 5.4 and 6.5. 
A general model-independent approach to the theory of critical phenom- 
ena begins with a continuum description of the free energy, the Ginzburg— 
Landau expansion (see Sect. 7.4.1): 


F(M] = [ate {Sr — T.)M? + aM os 5(VM)? oe un} y (8:3:23) 


where e~*/*T denotes the statistical weight of a configuration M(x). The 
most probable configuration is given by 


OF 


7 io al (T —T.)M — cV?M + bM? —h. (8.3.24) 
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It follows from this that the magnetization and the susceptibility in the limit 
h— 0 are 
1 


SE ape = ey 
M~(T.-T)‘“O(T.-T) and x TT 


Since the correlation length diverges on approaching the critical point, £ — 
co, the fluctuations also become slow. This suggests the following stochastic 
equation of motion for the magnetization!” 

M(x,t) = SET + r(x, t). (8.3.25) 
The first term in the equation of motion causes relaxation towards the 
minimum of the free-energy functional. This thermodynamic force becomes 
stronger as the gradient )F/d)M(x) increases. The coefficient \ character- 
izes the relaxation rate analogously to I’ in the Smoluchowski equation. Fi- 
nally, r(x,t) is a stochastic force which is caused by the remaining degrees 
of freedom. Instead of a finite number of stochastic variables, we have here 
stochastic variables (x,t) and r(x,t) which depend on a continuous index, 
the position x. 

Instead of M(x), we can also introduce its Fourier transform 


My = sf déz e—™** M(x) (8.3.26) 


and likewise for r(x,t). Then the equation of motion (8.3.25) becomes 


OF 


My = —A 
. aM_.. 


+rx(t) . (8.3.25’) 


Finally, we still have to specify the properties of the stochastic forces. Their 
average value is zero 


(r(x, t)) = (r«(t)) = 0 


and furthermore they are correlated spatially and temporally only over short 
distances, which we can represent in idealized form by 


(re (t) ree (t’)) — Q20KT Oy, 4 O(t 4 t') (8.3.27) 
(r(x, t)r(x’, t’)) = 2AkT6(x — x’)d(t— 1’) . (8.3.27') 


For the mean square deviations of the force, we have postulated the Ein- 
stein relation, which guarantees that an equilibrium distribution is given by 


12 Also called the TDGL = time-dependent Ginzburg-Landau model. 
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e— 971M], We also assume that the probability density for the stochastic forces 
r(x,t) is a Gaussian distribution (cf. (8.1.26)). This has the result that the 
odd correlation functions for r(x,t) vanish and the even ones factor into 
products of (8.3.27’) (sum over all the pairwise contractions). We will now 
investigate the equation of motion (8.3.25’) for T > T,. In what follows, we 
use the Gaussian approximation, i.e. we neglect the anharmonic terms; then 
the equation of motion simplifies to 


My = —A(a'(T — T.) + ck”)My t+ re - (8.3.28) 


Its solution is already familiar from the elementary theory of Brownian mo- 
tion: 


t 
M,(t) = e~ My (0) + e7 | dt! ry, (t/)e™" , (8.3.29) 
0 


as is the resulting correlation function 


/ AkT , 
OO Mia) ee Oi Meno ete) (8.3.30) 


or, for times t,t’ > 7,1, 


kT 


— 7, |t—t’ 


(Mx(t) Mu (t’)) = dk,-1 


Here, we have introduced the relaxation rate 

ve = A(al(T — T,) + ck?) . (8.3.32a) 
In particular, for k = 0 we find 

yo~ (T-Te) ~E?. (8.3.32b) 


As we suspected at the beginning, the relaxation rate decreases drastically on 
approaching the critical point. One denotes this situation as “critical slowing 
down”. 

As we already know from Chap. 7, the interaction bM* between the crit- 
ical fluctuations leads to a modification of the critical exponents, e.g. € — 
(T — T.)~”. Likewise, in the framework of dynamic renormalization group 
theory it is seen that these interactions lead in the dynamics to 


Yo > &-* (8.3.33) 


with a dynamic critical exponent z +3 which differs from 2. 


13 See e.g. F. Schwabl and U.C. Tauber, Encyclopedia of Applied Physics, Vol. 13, 
343 (1995), VCH. 
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Remark: 


According to Eq. (8.3.25’), the dynamics of the order parameter are relax- 
ational. For isotropic ferromagnets, the magnetization is conserved and the 
coupled precessional motion of the magnetic moments leads to spin waves. 
In this case, the equations of motion are given by4 


M(x, t) = —AM(x,t) x oF (x,1) | ry (x, +r(x,t), (8.3.34) 
with 

(r(x,t)) =0, (8.3.35) 

(s(x, t)r, (x, t)) = —2PkTV75O (x — x’) 5(t — t') 5,5 , (8.3.36) 


which leads to spin diffusion above the Curie temperature and to spin waves 
below it (cf. problem 8.9). The first term on the right-hand side of the equa- 
tion of motion produces the precessional motion of the local magnetization 
M(x, t) around the local field 6F /5M(x,t) at the point x. The second term 
gives rise to the damping. Since the magnetization is conserved, it is taken 
to be proportional to V7?, i.e. in Fourier space it is proportional to k?. These 
equations of motion are known as the Bloch equations or Landau-—Lifshitz 
equations and, without the stochastic term, have been applied in solid-state 
physics since long before the advent of interest in critical dynamic phenom- 
ena. The stochastic force r(x,t) is due to the remaining, rapidly fluctuating 
degrees of freedom. The functional of the free energy is 


1 b 
F[M(x, t)] = 5 [ave la'(r — T,)M?(x, t) + 5M'(x,t) 
+ c(VM(x, t))? —hM(x,t)| . (8.3.37) 
*8.3.4 The Smoluchowski Equation and Supersymmetric Quantum 
Mechanics 
8.3.4.1 The Eigenvalue Equation 


In order to bring the Smoluchowski equation (8.2.14) (V’ = OV/0x) = -F 


OP. 8 a 
a = l be («rz v’) P (8.3.38) 


into a form which contains only the second derivative with respect to x, we 
apply the Ansatz 


‘4S. Ma and G.F. Mazenko, Phys. Rev. B11, 4077 (1975). 
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P(a,t) = e~V )/24T oa, t) , (8.3.39) 
obtaining 
2 " 2 
2p =kTT (= _ cre] p. (8.3.40) 
This is a Schrédinger equation with an imaginary time 
2 
ih ESET = ( — v°(2)) p. (8.3.41) 
with the potential 
V°(a) = : ve = (8.3.42) 
2 )4(kT)? 9 2kT 
Following the separation of the variables 
plage OEP a)s (8.3.43) 


we obtain from Eq. (8.3.40) the eigenvalue equation 
1 
en = (—En + V(x) en(2) - (8.3.44) 


Formally, equation (8.3.44) is identical with a time-independent Schrodinger 
equation.’> In (8.3.43) and (8.3.44), we have numbered the eigenfunctions 
and eigenvalues which follow from (8.3.44) with the index n. 

The ground state of (8.3.44) is given by 


yo =Ne"#T, Ey =0, (8.3.45) 


where NV is a normalization factor. Inserting in (8.3.39), we find for P(z,t) 
the equilibrium distribution 


P(x, t) =Ne~V@)/kF (8.3.45’) 


From (8.3.42), we can immediately see the connection with supersymmetric 
quantum mechanics. The supersymmetric partner!® to V° has the potential 


1 
2 


ve 4 
A(kT)? | 2kT 


vi= (8.3.46) 


' N.G. van Kampen, J. Stat. Phys. 17, 71 (1977). 

'6 M. Bernstein and L.S. Brown, Phys. Rev. Lett. 52, 1933 (1984); F. Schwabl, 
QM I, Chap. 19, Springer 2005. The quantity ® introduced there is connected to 
the ground state wavefunctions yo and the potential V as follows: 6 = —yo/¢o = 
V!/2kT. 
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H°® H} 
Fig. 8.6. The excitation spec- 
tra of the two Hamiltonians H° 
and H’*, from QM I, pp. 353 
Bose sector Fermi sector and 361 


The excitation spectra of the two Hamiltonians 


Hol =" 4 yl) (8.3.47) 
are related in the manner shown in Fig. 8.6. One can advantageously make 
use of this connection if the problem with H! is simpler to solve than that 


with H°. 


8.3.4.2 Relaxation towards Equilibrium 


We can now solve the initial value problem for the Smoluchowski equation in 
general. Starting with an arbitrarily normalized initial distribution P(x), we 
can calculate p(x) and expand in the eigenfunctions of (8.3.44) 


p(z) = eV PET Pe 2) CnPn(2) ; (8.3.48) 
with the expansion coefficients 
Cn = fare) jeV P)/28F P(g) (8.3.49) 


From (8.3.43), we find the time dependence 


jaye yeaa) (8.3.50) 


from which, with (8.3.39), 


Pape. EEE SS Glens aa) (8.3.51) 
n=0 
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follows. The normalized ground state has the form 


e7V (®)/2kT ( 
(da e-V)/7) ae 
Therefore, the expansion coefficient co is given by 
dz P 1 
co = ic pie’ (#)/2kT pry) — Jax P(x) = 7a ° 
(fda e-V@)/T) / (fda e~V(@)/kT) 
(8.3.53) 
This allows us to cast (8.3.51) in the form 
ees ne kTTE 

P(a,t) = per EEE SN © cae PEE MOA @)y. (83.54) 


n=1 


With this, the initial-value problem for the Smoluchowski equation is solved 
in general. Since E,, > 0 for n > 1, it follows from this expansion that 


eV (a)/kT 


Jim. P(z,t) = fdxe-V@/kT : (8.3.55) 


which means that, starting from an arbitrary initial distribution, P(«,t) de- 
velops at long times towards the equilibrium distribution (8.3.45) or (8.3.55). 
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Problems for Chapter 8 


8.1 Derive the generalized Fokker—Planck equation, (8.2.18). 


8.2 A particle is moving with the step length / along the z-axis. Within each time 
step it hops to the right with the probability p+ and to the left with the probability 
p— (p+ +p— = 1). How far is it from the starting point on the average after t time 
steps if py = p— = 1/2, or if p, = 3/4 and p_ = 1/4? 
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8.3 Diffusion and Heat Conductivity 
(a) Solve the diffusion equation 


n= DAn 
for d= 1, 2 and 3 dimensions with the initial condition 
n(x, t = 0) = N67(x) . 


Here, n is the particle density, N the particle number, and D is the diffusion 
constant. 
(b) Another form of the diffusion equation is the heat conduction equation 


where T is the temperature, « the coefficient of thermal conductivity, c the specific 
heat, and p the density. 

Solve the following problem as an application: potatoes are stored at +5°C in 
a broad trench which is covered with a loose layer of earth of thickness d. Right 
after they are covered, a cold period suddenly begins, with a steady temperature 
of —10°C, and it lasts for two months. How thick does the earth layer have to be 
so that the potatoes will have cooled just to 0°C at the end of the two months? 
Assume as an approximation that the same values hold for the earth and for the 


potatoes: « = 0.4, c= 2000 ae p = 1000 <e 


8.4 Consider the Langevin equation of an overdamped harmonic oscillator 


a(t) = —Pa(t) + h(t) + r(t), 


where A(t) is an external force and r(t) a stochastic force with the properties 
(8.1.25). Compute the correlation function 


Cit, t') = (2(Hx(t')),,<0 > 


the response function 


x(t,t’) = 
and the Fourier transform of the response function. 


8.5 Damped Oscillator 
(a) Consider the damped harmonic oscillator 


mi + mle + muwgx = f(t) 


with the stochastic force f(t) from Eq. (8.1.25). Calculate the correlation function 
and the dynamic susceptibility. Discuss in particular the position of the poles and 
the line shape. What changes relative to the limiting cases of the non-damped 
oscillator or the overdamped oscillator? 

(b) Express the stationary solution (x(t)) under the action of a periodic external 
force fe(t) = fo cos at in terms of the dynamic susceptibility. Use it to compute 


the power dissipated, + fo at fe(t)(a(t)). 
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8.6 Diverse physical systems can be described as a subsystem capable of oscilla- 
tions that is coupled to a relaxing degree of freedom, whereby both systems are in 
contact with a heat bath (e.g. the propagation of sound waves in a medium in which 
chemical reactions are taking place, or the dynamics of phonons taking energy/heat 
diffusion into account). As a simple model, consider the following system of coupled 
equations: 


L£=—p 
m 


p = —mugr — Ip + by + R(t) 


jee +r(t) 
y= mie : 


Here, x and p describe the vibrational degrees of freedom (with the eigenfrequency 
wo), and y is the relaxational degree of freedom. The subsystems are mutually 
linearly coupled with their coupling strength determined by the parameter b. The 
coupling to the heat bath is accomplished by the stochastic forces R and r for 
each subsystem, with the usual properties (vanishing of the average values and the 
Einstein relations), and the associated damping coefficients I’ and y. 

(a) Calculate the dynamic susceptibility y.(w) for the vibrational degree of free- 
dom. 

(b) Discuss the expression obtained in the limiting case of 7 — 0, i.e. when the 
relaxation time of the relaxing system is very long. 


8.7 An example of an application of the overdamped Langevin equation is an 
electrical circuit consisting of a capacitor of capacity C' and a resistor of resistance 
R which is at the temperature T’. The voltage drop Ur over the resistor depends 
on the current J via Ur = RI, and the voltage Uc over the capacitor is related to 
the capacitor charge Q via Uc = g. On the average, the sum of the two voltages is 
zero, Ur + Uc = 0. In fact, the current results from the motion of many electrons, 
and collisions with the lattice ions and with phonons cause fluctuations which are 


modeled by a noise term Vj, in the voltage balance (J = Q) 


cm a 
RQ+=Q=V 
Q+ Ge th 

or 
: 1 1 
Ust pGue = RGM - 


(a) Assume the Einstein relation for the stochastic force and calculate the spectral 
distribution of the voltage fluctuations 


o(w) = ie dt e“* (U-(t)U-(0)) . 
(b) Compute 
(U2) = (Ue(t)Ue(t)) = i. adie) 


—oo 


and interpret the result, 4C(U2) = kT. 
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8.8 In a generalization of problem 8.7, let the circuit now contain also a coil or 
inductor of self-inductance L with a voltage drop U, = LI. The equation of motion 
for the charge on the capacitor is 


O+RQ+2Q=Var. 


By again assuming the Einstein relation for the noise voltage Vin, calculate the 
spectral distribution for the current f° dte'’(I(t)I(0)). 


8.9 Starting from the equations of motion for an isotropic ferromagnet (Eq. 8.3.34), 
investigate the ferromagnetic phase, in which 


M(x, t) = é.Mo + M(x, t) 


holds. 

(a) Linearize the equations of motion in 6M(x, t), and determine the transverse 
and longitudinal excitations relative to the z-direction. 

(b) Calculate the dynamic susceptibility 


4 = OM; (x t) 
i(k _ & dt i(kx—wt) ’ 
vith) = fdtedte dh;(0, 0) 


and the correlation function 
Getta) = / dx dt o*— (5. M(x, t)5M;(0,0)) . 


8.10 Solve the Smoluchowski equation 
OP(a,t) _ r 0 («r O a) P(:,t) 


ot Ox Ox _ Ox 


mw? 2 


5 @", by 


for an harmonic potential and an inverted harmonic potential V(x) = + 
solving the corresponding eigenvalue problem. 


8.11 Justify the Ansatz of Eq. (8.3.39) and carry out the rearrangement to give 
Eq. (8.3.40). 


8.12 Solve the Smoluchowski equation for the model potential 
V (a) = 2kT log(cosh x) 


using supersymmetric quantum mechanics, by transforming as in Chapter 8.3.4 
to a Schrédinger equation. (Literature: F. Schwabl, Quantum Mechanics, 3° ed., 
Chap. 19 (Springer Verlag, Heidelberg, New York, corrected printing 2005.) 


8.13 Stock-market prices as a stochastic process. 
Assume that the logarithm I(t) = log S(t) of the price S(t) of a stock obeys the 
Langevin equation (on a sufficiently rough time scale) 


d 
Si) =r +r) 


where r is a constant and I is a Gaussian “random force” with ((é)I(t’)) = 
2 U 
oa 6(t—t’). 
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(a) Explain this approach. Hints: What does the assumption that prices in the 
future cannot be predicted from the price trends in the past imply? Think first of 
a process which is discrete in time (e.g. the time dependence of the daily closing 
rates). Should the transition probability more correctly be a function of the price 
difference or of the price ratio? 

(b) Write the Fokker—Planck equation for /, and based on it, the equation for S. 
(c) What is the expectation value for the market price at the time t, when the 
stock is being traded at the price So at time to = 0? Hint: Solve the Fokker—Planck 
equation for | = log S. 


9. The Boltzmann Equation 


9.1 Introduction 


In the Langevin equation (Chap. 8), irreversibility was introduced phe- 
nomenologically through a damping term. Kinetic theories have the goal of 
explaining and quantitatively calculating transport processes and dissipative 
effects due to scattering of the atoms (or in a solid, of the quasiparticles). 
The object of these theories is the single-particle distribution function, whose 
time development is determined by the kinetic equation. 

In this chapter, we will deal with a monatomic classical gas consisting 
of particles of mass m; we thus presume that the thermal wavelength A7 = 
2rh/V2rmkT and the volume per particle v = n~! obey the inequality 


AT < nt oy 


i.e. the wavepackets are so strongly localized that the atoms can be treated 
classically. 

Further characteristic quantities which enter include the duration of a 
collision Tt. and the collision time + (this is the mean time between two 
collisions of an atom; see (9.2.12)). We have 7 & r-/tv and t & 1/nr2%, 
where r, is the range of the potentials and U is the average velocity of the 
particles. In order to be able to consider independent two-particle collisions, 
we need the additional condition 


Te KT, 


i.e. the duration of a collision is short in comparison to the collision time. 
This condition is fulfilled in the low-density limit, r, < n~!/3. Then collisions 
of more than two particles can be neglected. 

The kinetic equation which describes the case of a dilute gas considered 
here is the Boltzmann equation’. The Boltzmann equation is one of the most 
fundamental equations of non-equilibrium statistical mechanics and is applied 
in areas far beyond the case of the dilute gas?. 


' Ludwig Boltzmann, Wien. Ber. 66, 275 (1872); Vorlesungen tiber Gastheorie, 
Leipzig, 1896; Lectures on Gas Theory, translated by S. Brush, University of 
California Press, Berkeley, 1964 

? See e.g. J. M. Ziman, Principles of the Theory of solids, 2°* ed, Cambridge 
Univ. Press, Cambridge, 1972. 
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In this chapter we will introduce the Boltzmann equation using the clas- 
sical derivation of Boltzmann'. Next, we discuss some fundamental questions 
relating to irreversibility based on the H theorem. As an application of the 
Boltzmann equation we then determine the hydrodynamic equations and 
their eigenmodes (sound, heat diffusion). The transport coefficients are de- 
rived systematically from the linearized Boltzmann equation using its eigen- 
modes and eigenfrequencies. 


9.2 Derivation of the Boltzmann Equation 


We presume that only one species of atoms is present. For these atoms, we 
seek the equation of motion of the single-particle distribution function. 


Definition: The single-particle distribution function f(x, v,t) is defined by 
f(x, v, t) da d?v = the number of particles which are found at time t in 
the volume element d?2 around the point x and d?v around the velocity 
v. 


pee d°v f(x,v,t)=N. (9.2.1) 


The single-particle distribution function f(x, v, t) is related to the N-particle 
distribution function p(x1, v1,...,XN,Vw,t) (Eq. (2.3.1)) through 
f(x1,v1,¢) =N f dae d?v2 ... f Barn dun p(X1,V1,---,XN, VN, ft). 


Remarks: 


1. In the kinetic theory, one usually takes the velocity as variable instead 
of the momentum, v = p/m. 

2. The 6-dimensional space generated by x and v is called ju space. 

3. The volume elements d?z and dv are supposed to to be of small linear 
dimensions compared to the macroscopic scale or to the mean velocity 
0 = /~kT/m, but large compared to the microscopic scale, so that many 
particles are to be found within each element. In a gas under standard 
conditions (T = 1°C, P = latm), the number of molecules per cm? is 
n = 3x 10%. In a cube of edge length 10-3 cm, i.e. a volume element 
of the size d?x = 10~°cm®, which for all experimental purposes can be 
considered to be pointlike, there are still 3 x 10'° molecules. If we choose 
d°v ~ 1076 x 33, then from the Maxwell distribution 


P0) =n (si) 


in this element of jz space, there are f° d?x d?v = 10+ molecules. 


To derive the Boltzmann equation, we follow the motion of a volume 
element in ys space during the time interval [t, t+ dt]; cf. Fig. 9.1. Since those 
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Uz 


dv, 


Fig. 9.1. Deformation of a vol- 
ume element in yz space during the 
time interval dt. 


particles with a higher velocity move more rapidly, the volume element is 
deformed in the course of time. However, the consideration of the sizes of the 
two parallelepipeds? yields 


Ba! Bul = dxdv. (9.2.2) 


The number of particles at the time ¢ in d3x dv is f(x,v,t) dx dv, and 
the number of particles in the volume element which develops after the time 
interval dt is f(x + vdt,v + +Fdt,t + dt) d°z' dv’. If the gas atoms were 
collision-free, these two numbers would be the same. A change in these par- 
ticle numbers can only occur through collisions. We thus obtain 


ic tv dt,v 4 “Patt + dt) Flov,) Brdv = 
= ot) dtd?xd°v , (9.2.3) 
coll 


i.e. the change in the particle number is equal to its change due to collisions. 
The expansion of this balance equation yields 


0 1 Of 
— 2+ —F(x)Vi Vvjt)= = : 9.2.4 
E ? _ a8 m ~) | Fe . ) ot Me ( ) 
The left side of this equation is termed the flow term*. The collision term 


of) can be represented as the difference of gain and loss processes: 
coll 


Of\ 
a) =g-l. (9.2.5) 


Here, gd°z d?v dt is the number of particles which are scattered during the 
time interval dt into the volume d?z dv by collisions, and Id? x d*v dt is the 


3 The result obtained here from geometric considerations can also be derived by 
using Liouville’s theorem (L. D. Landau and E. M. Lifshitz, Course of Theoretical 
Physics, Vol. I: Mechanics, Pergamon Press, Oxford 1960, Eq. (4.6.5)). 

4 In Remark (i), p. 441, the flow term is derived in a different way. 
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number which are scattered out, i.e. the number of collisions in the volume 
element d°x in which one of the two collision partners had the velocity v 
before the collision. We assume here that the volume element d?v is so small 
in velocity space that every collision leads out of this volume element. 

The following expression for the collision term is Boltzmann’s celebrated 
Stosszahlansatz (assumption regarding the number of collisions): 


ot) = [ee d?v3 dug W(v, v2; v3, v4) [f (x, v3, t) f(x, va, b) 
coll 


— f(x, v,t) f(x, ve,t)] . (9.2.6) 


Here, W(v, V2; v3, v4) refers to the transition probability v, v2 — vs, va, 


3 8 ¥ a 4 
ped ped Fig. 9.2. Gain and loss processes, 
4 7 3 


g and | 


i.e. the probability that in a collision two particles with the velocities v and v2 
will have the velocities v3 and v4 afterwards. The number of collisions which 
lead out of the volume element considered is proportional to the number of 
particles with the velocity v and the number of particles with velocity va, 
and proportional to W(v, v2; v3, v4); a sum is carried out over all values of 
v2 and of the final velocities v3 and v4. The number of collisions in which an 
additional particle is in the volume element d*v after the collision is given by 
the number of particles with the velocities v3 and v4 whose collision yields a 
particle with the velocity v. Here, the transition probability W (v3, v4; v, v2) 
has been expressed with the help of (9.2.8e). 

The Stosszahlansatz (9.2.6), together with the balance equation (9.2.4), 
yields the Boltzmann equation 


fa) 1 
E tvVit “F(9V)| f(x, v,t) = 


[ier fabes fate W(v, vo; v3, v4) (f(x, v3, t) f(x, Va, t) 
- f(x, v,t) f(x, v2, t)) . (9.2.7) 


It is a nonlinear integro-differential equation. 
The transition probability W has the following symmetry properties: 


e Invariance under particle exchange: 


W(v, V2; V3, V4) = W(vo, V; v4, V3) - (9.2.8a) 
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e Rotational and reflection invariance: with an orthogonal matrix D we have 
W (Dv, Dv2; Dv3, Dv4) = W(v,v23 V3, V4) - (9.2.8b) 
This relation contains also inversion symmetry: 
W (—v, —v2; —v3, —v4) = W(v, va; v3, v4) . (9.2.8c) 
e Time-reversal invariance: 
W (v, V2; V3, V4) = W(—vs3, —V4; —V, —Ve) . (9.2.8d) 


The combination of inversion and time reversal yields the relation which 


we have already used in (9.2.6) for of) 


W (v3, V4; V, V2) = W(v, v2; v3, v4) - (9.2.8e) 


From the conservation of momentum and energy, it follows that 


W (v1, V2; V3, V4) = o(V¥1, V2; V3, v4) (pi + po — ps — Ps) 


2 2 2 2 
Pi , P2 P3 P3 
o o( "2m 2m Ps) pe AE) 


as one can see explicitly from the microscopic calculation of the two-particle 
collision in Eq. (9.5.21). The form of the scattering cross-section ¢ depends 
on the interaction potential between the particles. For all the general, fun- 
damental results of the Boltzmann equation, the exact form of o is not im- 
portant. As an explicit example, we calculate o for the interaction potential 
of hard spheres (Eq. (9.5.15)) and for a potential which falls off algebraically 
(problem 9.15, Eq. (9.5.29)). 

To simplify the notation, in the following we shall frequently use the 
abbreviations 


fi = f(x, vi,t) with v1 =v, 
fo = f(x, V2,t), fs = f(x, v3, t), and fa = f(x, va,t) 


Remarks: 


(9.2.9) 


(i) The flow term in the Boltzmann equation can also be derived by setting up 
an equation of continuity for the fictitious case of collision-free, non-interacting gas 
atoms. To do this, we introduce the six-dimensional velocity vector 


gee (v iSeys *) (9.2.10) 


m 


and the current density wf(x,v,t). For a collision-free gas, f fulfills the equation 
of continuity 
of 


Bet divwf =0. (9.2.11) 
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Using Hamilton’s equations of motion, Eq. (9.2.11) takes on the form 


(4 Saved: + F(x)V. faewd=0 (9.2.11’) 
of the flow term in Eqns. (9.2.4) and (9.2.7). 

(ii) With a collision term of the form (9.2.6), the presence of correlations be- 
tween two particles has been neglected. It is assumed that at each instant the 
number of particles with velocities v3 and v4, or v and vg, is uncorrelated, an 
assumption which is also referred to as molecular chaos. A statistical element is 
introduced here. As a justification, one can say that in a gas of low density, a bi- 
nary collision between two molecules which had already interacted either directly 
or indirectly through a common set of molecules is extremely improbable. In fact, 
molecules which collide come from quite different places within the gas and previ- 
ously underwent collisions with completely different molecules, and are thus quite 
uncorrelated. The assumption of molecular chaos is required only for the particles 
before a collision. After a collision, the two particles are correlated (they move 
apart in such a manner that if all motions were reversed, they would again col- 
lide); however, this does not enter into the equation. It is possible to derive the 
Boltzmann equation approximately from the Liouville equation. To this end, one 
derives from the latter the equations of motion for the single-, two-, etc. -particle 
distribution functions. The structure of these equations, which is also called the 
BBGKY (Bogoliubov, Born, Green, Kirkwood, Yvon) hierarchy, is such that the 
equation of motion for the r-particle distribution function (r = 1, 2,...) contains in 
addition also the (r + 1)-particle distribution function®. In particular, the equation 
of motion for the single-particle distribution function f(x, v,t) has the form of the 
left side of the Boltzmann equation. The right side however contains fo, the two- 
particle distribution function, and thus includes correlations between the particles. 
Only by an approximate treatment, i.e. by truncating the equation of motion for 
fe itself, does one obtain an expression which is identical with the collision term of 
the Boltzmann equation®. 

It should be mentioned that terms beyond those in the Boltzmann equation 
lead to phenomena which do not exhibit the usual exponential decay in their re- 
laxation behavior, but instead show a much slower, algebraic behavior; these time 
dependences are called “long time tails”. Considered microscopically, they result 
from so called ring collisions; see the reference by J. A. McLennan at the end of this 
chapter. Quantitatively, these effects are in reality immeasurably small; up to now, 
they have been observed only in computer experiments. In this sense, they have 
a similar fate to the deviations from exponential decay of excited quantum levels 
which occur in quantum mechanics. 

(iii) To calculate the collision time 7, we imagine a cylinder whose length is 
equal to the distance which a particle with thermal velocity travels in unit time, 
and whose basal area is equal to the total scattering cross-section. An atom with 
a thermal velocity passes through this cylinder in a unit time and collides with all 
the other atoms within the cylinder. The number of atoms within the cylinder and 
thus the number of collisions of an atom per second is otott¥n, and it follows that 
the mean collision time is 

1 


Otot UN 


(9.2.12) 


a 


° The r-particle distribution function is obtained from the N-particle distribution 


function by means of f,(X1,V1,...Xr,Vr,t) = Woy f Prri@rpidand un 
p(X1,V1,-..Xn,Vwn,t). The combinatorial factor results from the fact that it is 


not important which of the particles is at the u-space positions x1, V1,.... 
® See references at the end of this chapter, e.g. K. Huang, S. Harris. 
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The mean free path | is defined as the distance which an atom typically travels 
between two successive collisions; it is given by 


1 


OtotN 


(9.2.13) 


l=vrT= 


(iv) Estimates of the lengths and times which play a role in setting up the 
Boltzmann equation: the range r. of the potential must be so short that collisions 
occur between only those molecules which are within the same volume element 
d?x: re < dx. This inequality is obeyed for the numerical example r, ~ 107° cm, 
dz = 10-°cm. With 0 = 10°, we obtain for the time during which the particle 


ade tN -3 = ; baie ts 
is within d°x the value tas, * oom s 10~*sec. The duration of a collision is 


sec 


-8 2 es ; ao = 
tT & Dos » 10 83sec, the collision time tT & (r2nv)~* x» (107'%cem? x 3 x 


sec 
10'°cm7? x 10°emsec™')~! & 3 x 107°sec. 


9.3 Consequences of the Boltzmann Equation 


9.3.1 The H-Theorem’ and Irreversibility 


The goal of this section is to show that the Boltzmann equation shows irre- 
versible behavior, and the distribution function tends towards the Maxwell 
distribution. To do this, Boltzmann introduced the quantity H, which is re- 
lated to the negative of the entropy: ” 


(x,t) = [ee f(x, v, t) log f(x, v,t) . (9.3.1) 


For its time derivative, one obtains from the Boltzmann equation (9.2.7) 


(x,t) 


[eed tens 


I 


- [eva + log f) Gz + “FV. ) f-I (9.3.2) 
= -V. [ dv (Flog f) v—I. 


The second term in the large brackets in the second line is proportional 
to f[ d’v V.(f log f) and vanishes, since there are no particles with infinite 
velocities, i.e. f — 0 for v — ov. 


” Occasionally, the rumor makes the rounds that according to Boltzmann, this 
should actually be called the Eta-Theorem. In fact, Boltzmann himself (1872) 
used F& (for entropy), and only later (S.H. Burbury, 1890) was the Roman letter 
H adopted (D. Flamm, private communication, and S.G. Brush, Kinetic Theory, 
Vol. 2, p. 6, Pergamon Press, Oxford, 1966). 
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The contribution of the collision term 
[= / vy Aus dug du, W(vi, V2; V3; va)(fi fe = fafa) + log fi) (9.3.3) 


is found by making use of the invariance of W with respect to the exchanges 
1,3 << 2,4 and 1,2 — 3,4 to be 


l= if an Pus d?u3 du, W (v1, V2; V3, v4) (fi fo— fs fa) log a2 : (9.3.4) 
3J4 


The rearrangement which leads from (9.3.3) to (9.3.4) is a special case of the 
general identity 


dv, Bug Bus du, W (v1, V2; V3, va) (fifo — fafa)yr 
1 
=e i Pv, dvz d3v3 d3v4, W (v1, V2; V3, V4) X 


x (fife — fafa)(~1 + 2 -— 3 — pa), (9.3.5) 


which follows from the symmetry relations (9.2.8), and where y; = v(x, vi, t) 
(problem 9.1). 
From the inequality (a — y) log 7 = 9, it follows that 


I>0. (9.3.6) 


The time derivative of H, Eq. (9.3.2), can be written in the form 


(x,t) = —V2ju(x,t) —T, (9.3.7) 
where 
jz = [@e floss Vv (9.3.8) 


is the associated current density. The first term on the right-hand side of 
(9.3.7) gives the change in H due to the entropy flow and the second gives 
its change due to entropy production. 


Discussion: 


a) If no external forces are present, F(x) = 0, then the simplified situation 
may occur that f(x,v,t) = f(v,t) is independent of x. Since the Boltzmann 
equation then contains no x-dependence, f remains independent of position 
for all times and it follows from (9.3.7), since Vija(x,t) =0, that 


H=-I<0. (9.3.9) 
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The quantity H decreases and tends towards a minimum, which is finite, 
since the function flog f has a lower bound, and the integral over v exists.® 
At the minimum, the equals sign holds in (9.3.9). In Sect. 9.3.3, we show that 
at the minimum, f becomes the Maxwell distribution 


3/2 aad 
f(v) =n () ew EF (9.3.10) 

b) When F(x) 4 0, and we are dealing with a closed system of volume 
V, then 


| Ba Vaju(x,t) =} dO jn(x,t) =0 
4 O(V) 


holds. The flux of H through the surface of this volume vanishes if the surface 
is an ideal reflector; then for each contribution v dO there is a corresponding 
contribution —v dO, and it follows that 


5 Ho = 5 [ bens) = - [ eer =U; (9.3.11) 
Ao, decreases, we have irreversibility. The fact that irreversibility fol- 
lows from an equation derived from Newtonian mechanics, which itself 
is time-reversal invariant, was met at first with skepticism. However, the 
Stosszahlansatz contains a probabilistic element, as we will demonstrate in 
detail following Eq. (9.3.14). 

As already mentioned, H is closely connected with the entropy. The cal- 
culation of H for the equilibrium distribution f°(v) for an ideal gas yields 


(see problem 9.3) H =n [log (n (str) ””) _ 3] . The total entropy S of the 
ideal gas (Eq. (2.7.27)) is thus 


S=-VkH —kN (3 log oa = 1) , (9.3.12a) 
m 


Here, fis Planck’s quantum of action. Expressed locally, the relation between 
the entropy per unit volume, H, and the particle number density n is 


S(x,t) = —kH(x,t)—k (s log an - 1) n(x, t) . (9.3.12b) 


8 One can readily convince oneself that H(t) cannot decrease without limit. Due 
to f@v f(x,v,t) < oo, f(x,v,t) is bounded everywhere and a divergence of 
H(t) could come only from the range of integration v — oo. For v > oo, f > 0 
must hold and as a result, log f — —oo. Comparison of H(t) = [@ouf log f 
with [ d>v vu" f(x, v,t) < 00 shows that a divergence requires |log f| > v?. Then, 


Uv 


however, f <e ” , and H remains finite. 
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The associated current densities are 


nh 
js(x,t) = —kju(x,t) —k (3 log = = 1) j(x,t) (9.3.12c) 
m 
and fulfill 
S(x,t) = —Vjs(x,t)+kI. (9.3.12d) 


Therefore, kI has the meaning of the local entropy production. 


*9.3.2 Behavior of the Boltzmann Equation under Time Reversal 


In a classical time-reversal transformation T (also motion reversal), the mo- 
menta (velocities) of the particles are reversed (v — —v)%. Consider a 
system which, beginning with an initial state at the positions x,,(0) and 
the velocities v,(0), evolves for a time t, to the state {x,,(t),vn(t)}, then 
at time t; experiences a motion-reversal transformation {xn(t1),Vn(t1)} > 
{xn(t1), —Vn(ti)}; then if the system is invariant with respect to time re- 
versal, the further motion for time t, will lead back to the motion-reversed 
initial state {x,(0), —vn(0)}. The solution of the equations of motion in the 
second time period (t > t1) is 


 (t) = x(2t1 — t) (9.3.13) 
Here, we have assumed that no external magnetic field is present. Apart from 


a translation by 2t,, the replacement t — —t,v — —v is thus made. Under 
this transformation, the Boltzmann equation (9.2.7) becomes 


(Gtwx+ “F(0)Vy) f(x, -—v, -—t) = —I[f(x,-v,-t)] . (9.3.14) 
The notation of the collision term should indicate that all distribution func- 
tions have the time-reversed arguments. The Boltzmann equation is therefore 
not time-reversal invariant; f(x,—v, —t) is not a solution of the Boltzmann 
equation, but instead of an equation which has a negative sign on its right- 
hand side (—J [ f(x, —v, —t)])). 

The fact that an equation which was derived from Newtonian mechan- 
ics, which is time-reversal invariant, is itself not time-reversal invariant and 
exhibits irreversible behavior (Eq. (9.3.11)) may initially appear surprising. 
Historically, it was a source of controversy. In fact, the Stosszahlansatz con- 
tains a probabilistic element which goes beyond Newtonian mechanics. Even 
if one assumes uncorrelated particle numbers, the numbers of particles with 


° See e.g. QM II, Sect. 11.4.1 
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the velocities v and v2 will fluctuate: they will sometimes be larger and some- 
times smaller than would be expected from the single-particle distribution 
functions f; and f2. The most probable value of the collisions is fy - fo, and 
the time-averaged value of this number will in fact be fi - fo. The Boltzmann 
equation thus yields the typical evolution of typical configurations of the 
particle distribution. Configurations with small statistical weights, in which 
particles go from a (superficially) probable configuration to a less probable 
one (with lower entropy) — which is possible in Newtonian mechanics — are 
not described by the Boltzmann equation. We will consider these questions in 
more detail in the next chapter (Sect. 10.7), independently of the Boltzmann 
equation. 


9.3.3 Collision Invariants and the Local Maxwell Distribution 


9.3.3.1 Conserved Quantities 


The following conserved densities can be calculated from the single-particle 
distribution function: the particle-number density is given by 


n(x,t) = pees. (9.3.15a) 


The momentum density, which is also equal to the product of the mass and 
the current density, is given by 


mj(x,t) = mn(x,t)u(x,t) = m { ®ovs : (9.3.15b) 


Equation (9.3.15b) also defines the average local velocity u(x,t). Finally, we 
define the energy density, which is composed of the kinetic energy of the local 
convective flow at the velocity u(x,t), ic. n(x, t)mu(x, t)?/2, together with 
the average kinetic energy in the local rest system’, n(x, t)e(x, t): 


t 2 2 
n(x, t) mae” + e(x,) = ee She feos (u? + ¢°) f ; 
(9.3.15c) 
Here, the relative velocity @ = v—u has been introduced, and f[ d*v df = 0, 
which follows from Eq. (9.3.15b), has been used. For e(x,t), the internal 


energy per particle in the local rest system (which is moving at the velocity 
u(x, t)), it follows from (9.3.15c) that 


n(x, t) =F feu (v —u(x,t))?f. (9.3.15c’) 


0 We note that for a dilute gas, the potential energy is negligible relative to the 
kinetic energy, so that the internal energy per particle e(x,t) = €(x,t) is equal 
to the average kinetic energy. 
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9.3.3.2 Collisional Invariants 


The collision integral J of Eq. (9.3.3) and the collision term in the Boltzmann 
equation vanish if the distribution function f fulfills the relation 


fife — ffs =0 (9.3.16) 


for all possible collisions (restricted by the conservation laws contained in 
(9.2.8f), ie. if 


log fi + log fo = log fz + log fa (9.3.17) 


holds. Note that all the distribution functions f; have the same x-argument. 
Due to conservation of momentum, energy, and particle number, each of the 
five so called collisional invariants 


x? = my; , i=1,2,3 (9.3.18a) 
2 

aes (9.3.18b) 

x=) (9.3.18c) 


obeys the relation (9.3.17). There are no other collisional invariants apart 
from these five!!. Thus the logarithm of the most general distribution function 
for which the collision term vanishes is a linear combination of the collisional 
invariants with position-dependent prefactors: 


log f£(x, v, t) = a(x, t) + B(x, t) (u(x, t)- mv — av?) (9.3.19) 


or 


F(av.t) = nla) (sam 2 sary y — en] . 
(9.3.19’) 


3 
Here, the quantities T(x, t) = (kG(x,t))~1, n(x,t) = (35) : exp |a(x, t) 


+(x, t)mu? (x, t)/2] and u(x,t) represent the local temperature, the local 
particle-number density, and the local velocity. One refers to f(x, v,t) as the 
local Maxwell distribution or the local equilibrium distribution function, since 
it is identical locally to the Maxwell distribution, (9.3.10) or (2.6.13). If we 
insert (9.3.19) into the expressions (9.3.15a-c) for the conserved quantities, 
we can see that the quantities n(x,t), u(x,t), and T(x, t) which occur on the 
right-hand side of (9.3.19’) refer to the local density, velocity, and tempera- 
ture, respectively, with the last quantity related to the mean kinetic energy 
via 


4H. Grad, Comm. Pure Appl. Math. 2, 331 (1949). 
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3 
e(x,t) = aT (x, t) F 


i.e. by the caloric equation of state of an ideal gas. 

The local equilibrium distribution function f’(x, v,t) is in general not a 
solution of the Boltzmann equation, since for it, only the collision term but 
not the flow term vanishes!?. The local Maxwell distribution is in general a 
solution of the Boltzmann equation only when the coefficients are constant, 
i.e. in global equilibrium. Together with the results from Sect. 9.3.1, it follows 
that a gas with an arbitrary inhomogeneous initial distribution f(x, v,0) will 
finally relax into a Maxwell distribution (9.3.10) with a constant temperature 
and density. Their values are determined by the initial conditions. 


9.3.4 Conservation Laws 


With the aid of the collisional invariants, we can derive equations of continuity 
for the conserved quantities from the Boltzmann equation. We first relate the 
conserved densities (9.3.15a—c) to the collisional invariants (9.3.18a-c). The 
particle-number density, the momentum density, and the energy density can 
be represented in the following form: 


n(x,t) = pee xf , (9.3.20) 

m ji(x,t) = mn(x, t)ui(x, t) = [ee San ae (9.3.21) 
and 

n(x, t) ae + a(x.) = pee bak ie (9.3.22) 


Next, we want to derive the equations of motion for these quantities from the 
Boltzmann equation (9.2.7) by multiplying the latter by x*(v) and integrat- 
ing over v. Using the general identity (9.3.7), we find 


[ee x°(v) E +vVat+ =F(9V)| f(x,v,t) =0. (9.3.23) 


By inserting x°, y'??, and y+ in that order, we obtain from (9.3.23) the 
following three conservation laws: 


12 There are special local Maxwell distributions for which the flow term likewise van- 
ishes, but they have no physical relevance. See G. E. Uhlenbeck and G. W. Ford, 
Lectures in Statistical Mechanics, American Mathematical Society, Providence, 
1963, p. 86; S. Harris, An Introduction to the Theory of the Boltzmann Equation, 
Holt Rinehart and Winston, New York, 1971, p. 73; and problem 9.16. 
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Conservation of Particle Number: 


a — 
ant Vi=0. (9.3.24) 


Conservation of Momentum: 


moj, + Vz, [omens — Fi(x)n(x) =0. (9.3.25) 


For the third term, an integration by parts was used. If we again employ the 
substitution v = u — ¢ in (9.3.25), we obtain 


0. 6) 
maj = ra + Pi) = nF; , (9.3.25’) 


where we have introduced the pressure tensor 
Py = Py = m [av Vids f - (9.3.26) 


Conservation of Energy: 


Finally, setting x4 = “& in (9.3.23), we obtain 


a Bo Sv ftVe, | Pv (utdi)> (w+2ujdj+¢")f—F-F = 0, (9.3.27) 


where an integration by parts was used for the last term. Applying (9.3.22) 
and (9.3.26), we obtain the equation of continuity for the energy density 


O m m : 

at fae a °)| eve [rus ( gt e) + ujPi +a | =J-F . (9.3.28) 
Here, along with the internal energy density e defined in (9.3.15c’), we have 
also introduced the heat current density 


qs [evo (Fe?) f. (9.3.29) 


Remarks: 


(i) (9.3.25’) and (9.3.28) in the absence of external forces (F = 0) take on 
the usual form of equations of continuity, like (9.3.24). 

(ii) In the momentum density, according to Eq. (9.3.25’), the tensorial cur- 
rent density is composed of a convective part and the pressure tensor 
P,;, which gives the microscopic momentum current in relation to the 
coordinate system moving at the average velocity u. 
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(iii) The energy current density in Eq. (9.3.28) contains a macroscopic con- 
vection current, the work which is performed by the pressure, and the 
heat current q (= mean energy flux in the system which is moving with 
the liquid). 

(iv) The conservation laws do not form a complete system of equations as 
long as the current densities are unknown. In the hydrodynamic limit, 
it is possible to express the current densities in terms of the conserved 
quantities. 


The conservation laws for momentum and energy can also be written as 
equations for u and e. To this end, we employ the rearrangement 


0 0 
api + Vj(nujui) = na vi + wan + uiVjnuz + nuz Vj ui 
6) 


using (9.3.21) and the conservation law for the particle-number density 
(9.3.21), which yields for (9.3.25’) 


6) 
From this, taking the hydrodynamic limit, we obtain the Navier-Stokes equa- 
tions. Likewise, starting from Eq. (9.3.28), we can show that 


0 
n (5 + WV) e+ Vq= —PiViuj ‘ (9.3.32) 


9.3.5 Conservation Laws and Hydrodynamic Equations for the 
Local Maxwell Distribution 


9.3.5.1 Local Equilibrium and Hydrodynamics 


In this section, we want to collect and explain some concepts which play a 
role in nonequilibrium theory. 

The term local equilibrium describes the situation in which the thermody- 
namic quantities of the system such as density, temperature, pressure, etc. can 
vary spatially and with time, but in each volume element the thermodynamic 
relations between the values which apply locally there are obeyed. The result- 
ing dynamics are quite generally termed hydrodynamics in condensed-matter 
physics, in analogy to the dynamic equations which are valid in this limit for 
the flow of gases and liquids. The conditions for local equilibrium are 


wr<1 and kl<l, (9.3.33) 
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where w is the frequency of the time-dependent variations and k their 
wavenumber, 7 is the collision time and / the mean free path. The first con- 
dition guarantees that the variations with time are sufficiently slow that the 
system has time to reach equilibrium locally through collisions of its atoms. 
The second condition presumes that the particles move along a distance | 
without changing their momenta and energies. The local values of momen- 
tum and energy must therefore in fact be constant over a distance J. 

Beginning with an arbitrary initial distribution function f(x,v,0), ac- 
cording to the Boltzmann equation, the following relaxation processes oc- 
cur: the collision term causes the distribution function to approach a local 
Maxwell distribution within the characteristic time 7. The flow term causes 
an equalization in space, which requires a longer time. These two approaches 
towards equilibrium — in velocity space and in configuration space — come 
to an end only when global equilibrium has been reached. If the system is 
subject only to perturbations which vary slowly in space and time, it will be 
in local equilibrium after the time 7. This temporally and spatially slowly 
varying distribution function will differ from the local Maxwellian function 
(9.3.19’), which does not obey the Boltzmann equation. 


9.3.5.2 Hydrodynamic Equations without Dissipation 


In order to obtain explicit expressions for the current densities q and P,;, 
these quantities must be calculated for a distribution function f(x, v,t) which 
at least approximately obeys the Boltzmann equation. In this section, we will 
employ the local Maxwell distribution as an approximation. In Sect. 9.4, the 
Boltzmann equation will be solved systematically in a linear approximation. 

Following the preceding considerations concerning the different relaxation 
behavior in configuration space and in velocity space, we can expect that in 
local equilibrium, the actual distribution function will not be very different 
from the local Maxwellian distribution. If we use the latter as an approxima- 
tion, we will be neglecting dissipation. 

Using the local Maxwell distribution, Eq. (9.3.19), 


=) 3 = m(v — u(x, a) (9.3.34) 


with position- and time-dependent density n, temperature J, and flow veloc- 
ity u, we find from (9.3.15a), (9.3.15b), and (9.3.15c’) 


f= n(x,t) 


j=nu (9.3.35) 


ne = SnkT (9.3.36) 


P,; = pee moib; fo = b;jnkT = bi5P , (9.3.37) 
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where the local pressure P was introduced; from (9.3.37), it is given by 
P=nkT. (9.3.38) 


The equations (9.3.38) and (9.3.36) express the local thermal and caloric 
equations of state of the ideal gas. The pressure tensor P;; contains no dissi- 
pative contribution which would correspond to the viscosity of the fluid, as 
seen from Eq. (9.3.37). The heat current density (9.3.29) vanishes (q = 0) 
for the local Maxwell distribution. 

With these results, we obtain for the equations of continuity 
(9.3.24), (9.3.25’), and (9.3.32) 


3) 
ri —Vnu (9.3.39) 
mn (5 +uv) u=—VP+nF (9.3.40) 
0 
n (3 + uv) e=-—PVu. (9.3.41) 


Here, (9.3.40) is Euler’s equation, well-known in hydrodynamics!3. The equa- 
tions of motion (9.3.39)—(9.3.41) together with the local thermodynamic re- 
lations (9.3.36) and (9.3.38) represent a complete system of equations for n, 
u, and e. 


9.3.5.3 Propagation of Sound in Gases 


As an application, we consider the propagation of sound. In this process, the 
gas undergoes small oscillations of its density n, its pressure P, its internal 
energy e, and its temperature JT around their equilibrium values and around 
u = 0. In the following, we shall follow the convention that thermodynamic 
quantities for which no position or time dependence is given are taken to 
have their equilibrium values, that is we insert into Eqns. (9.3.39)—(9.3.41) 


n(x, t) = n+ 6n(x, t), P(x,t) = P+ 6P(x,t), 


9.3.42 
e(x,t) =e+ de(x,t), T (x,t) =T + 6T (x,t) ( ) 
and expand with respect to the small deviations indicated by 6: 

6) 

aor =—-nVu (9.3.43a) 

mnou = —V6P (9.3.43b) 
6) 

ng de =—PVu. (9.3.43c) 


13 Buler’s equation describes nondissipative fluid flow; see L.D. Landau and 
E.M. Lifshitz, Course of Theoretical Physics, Vol. IV: Hydrodynamics, Perg- 
amon Press, Oxford 1960, p. 4. 
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The flow velocity u(x,t) = du(x,t) is small. Insertion of Eq. (9.3.36) and 
(9.3.38) into (9.3.43c) leads us to 


which, together with (9.3.43a), yields 


2 E | =A (9.3.44) 


atln 27 


Comparison with the entropy of an ideal gas, 


(9.3.45) 


5 (2mmkT)3/2 
S=kN + | ; 
(5 08 nh3 
shows that the time independence of S/N (i.e. of the entropy per particle or 
per unit mass) follows from (9.3.44). By applying 0/0t to (9.3.43a) and V 
to (9.3.43b) and eliminating the term containing u, we obtain 


d76n 
Ot? 
It follows from Eq. (9.3.38) that 


=m !V°oP. (9.3.46) 


5P = nkoT + 6nkT , 


and, together with (9.3.44), we obtain 26P = 3kTZ6n. With this, the 
equation of motion (9.3.46) can be brought into the form 
O56P — 5kT 
at2 3m 


VORP. (9.3.47) 


The sound waves (pressure waves) which are described by the wave equa- 
tion (9.3.47) have the form 


6P« eilkx£cs|k|t) (9.3.48) 


with the adiabatic sound velocity 


/ 1 /5kT 
es= = as ; (9.3.49) 
mnks 3m 


Here, Kg is the adiabatic compressibility (Eq. (3.2.3b)), which according to 
Eq. (3.2.28) is given by 


3 3V 


= 5p BNET (9.3.50) 


KS 


for an ideal gas. 
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Notes: 


The result that the entropy per particle S/N or the entropy per unit mass s 
for a sound wave is time-independent remains valid not only for an ideal gas 
but in general. If one takes the second derivative with respect to time of the 


following thermodynamic relation which is valid for local equilibrium! 
On On S 
én = | — oP ——]} o|— 9.3.51 
= (55) sy? * (are) oC) aa 
2 
ws O26n _ (On a®P én_\ 9 S/N ; 
obtaining 7 = (3B) s/w op + (x8), a then one obtains togther 
=0 
with (9.3.43a) and (9.3.43b) the result 
Ohi OP 
OER) gt (CE) Gay: (9.3.52) 
Ot? On} sin 


which again contains the adiabatic sound velocity 


Zh oP —m-) aP 
i An} gin ON/V/) 5 
OP 1 
— m-ln-l 2 3 
=m ~N-“( v9 (5) = mae 


Following the third equals sign, the particle number N was taken to be fixed. 

For local Maxwell distributions, the collision term vanishes; there is no 
damping. Between the regions of different local equilibria, reversible oscilla- 
tion processes take place. Deviations of the actual local equilibrium distribu- 
tion functions f(x,v,t) from the local Maxwell distribution f'(x,v,t) lead 
as a result of the collision term to local, irreversible relaxation effects and, 
together with the flow term, to diffusion-like equalization processes which 
finally result in global equilibrium. 


(9.3.53) 


*9.4 The Linearized Boltzmann Equation 


9.4.1 Linearization 


In this section, we want to investigate systematically the solutions of the 
Boltzmann equation in the limit of small deviations from equilibrium. The 
Boltzmann equation can be linearized and from its linearized form, the hy- 
drodynamic equations can be derived. These are equations of motion for 
the conserved quantities, whose region of validity is at long wavelengths and 


‘4 Within time and space derivatives, dn(x,t), etc. can be replaced by n(x, t) etc. 
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low frequencies. It will occasionally be expedient to use the variables (k, w) 
(wavenumber and frequency) instead of (x,t). We will also take an external 
potential, which vanishes for early times, into account: 


jim V(x,t) =0. (9.4.1) 


Then the distribution function is presumed to have the property 


slim fox,vyt) = Pv) an (ay) et 


Tt ee (9.4.2) 


where f° is the global spatially uniform Maxwellian equilibrium distribu- 
tion’. 

For small deviations from global equilibrium, we can write f(x,v,t) in 
the form 


f(x,v,t) = f°(v) (1 + me.) = fo+of (9.4.3) 


and linearize the Boltzmann equation in df or v. The linearization of the 
collision term (9.2.6) yields 


ot) = =f vg avn dog WPI PBI +F9 Ba f8 fi L985 fs— 2 Sf) 


1 
= fem dz du, W(v v1; v3va) f0(v1) f° (v2) (vitv2—Vv3—4) , 
(9.4.4) 
since f$f? = fPf2 owing to energy conservation, which is contained in 


W(vvi;v3v4). We also use the notation v; = v, f? = f°(v) etc. The flow 
term has the form 


2) 1 fe 
E +vVa+ =F(«.)V>| G + =) 


kT 
6a 
~ oP | 


o + vv, v(x,v,t) +v-(WV(x,t)) fo(v)/kT . (9.4.5) 


All together, the linearized Boltzmann equation is given by: 


E + vv, v(x, v,t) +v(VV (x, t)) = —Lv (9.4.6) 


15 We write here the index which denotes an equilibrium distribution as an upper 
index, since later the notation f? = f°(v;) will also be employed. 
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with the linear collision operator L: 
Lv = aw) / dv, dv3 du, W' (v, v2; V3, Va) (V + V2 — V3 — V4) (9.4.7) 
and 
W" (w v2; vs-va) = pos(F9(w) (v2) f°(vs) F(wa)) EW (v v2; vs va)» (9-48) 
where conservation of energy, contained in W, has been utilized. 


9.4.2 The Scalar Product 


For our subsequent investigations, we introduce the scalar product of two 
functions ~(v) and x(v), 


f°) 
kT 


Wh) = / Bo yy ww) ; (9.4.9) 


it possesses the usual properties. The collisional invariants are special cases: 


0 Vv nr 
(Ele) = (alt) = fae TW) (9.4.10a) 
mv? 0 Vv nr 
(x4|x°) = (ell) = f av 5 FW) = 7 = sn (9.4.10b) 
with «= my" and 
2 
Odlx*) = (ele) = [ee () ee = onkT (9.4.10c) 


The collision operator £ introduced in (9.4.7) is a linear operator, and obeys 
the relation 


1 
(y|Lv) = 7 i, Av, dv, v3 dv, W' (v1 V2; V3 V4) 
x (+2 —v3—m)(x' +x? -x?-x4). (9.4.11) 
It follows from this that L is self-adjoint and positive semidefinite, 


(x|Lv) = (£x\v) , (9.4.12) 
(v|£Lv) >0. (9.4.13) 
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9.4.3 Eigenfunctions of C and the Expansion of the Solutions of 
the Boltzmann Equation 


The eigenfunctions of £ are denoted as x* 
Ly? = wx ; Ww, >0. (9.4.14) 


The collisional invariants y1, x”, x?, y*, x° are eigenfunctions belonging to 
the eigenvalue 0. 
It will prove expedient to use orthonormalized eigenfunctions: 


(Ole) =e (9.4.15) 


For the collisional invariants, this means the introduction of 


gayua Ten = Tain  SSno4- (9.4.16a) 
(uj|ui) = 5 | ae v? f(v)/kT (here not summed over 7) ; 
ee (9.4.16b) 
Vl) nf kT 
=x = eae aoe. (9.4.16c) 


J (AI) (Cat) fele) = (1le)?) $n 


The eigenfunctions y* with w) > 0 are orthogonal to the functions (9.4.16a— 
c) and in the case of degeneracy are orthonormalized among themselves. An 
arbitrary solution of the linearized Boltzmann equation can be represented as 
a superposition of the eigenfunctions of £ with position- and time-dependent 
prefactors!® 


v(x, v,t) = a°(x, t)X” +a4(x, t)X? +a" (x,t em Sa (x, t)X¥* . (9.4.17) 


Here, the notation indicates the particle-number density n(x, t), the temper- 
ature T(x, t), and the flow velocity u,(x, t): 


Pot) = al(x,t) = (xb) = f dv og et = f dvss(x,v.ox" 


_ oe — ae 


T( 4.1 
a" OT x,t). (9.4.18a) 


16 Here we assume that the eigenfunctions oe form a complete basis. For the explic- 
itly known eigenfunctions of the Maxwell potential (repulsive r—4 potential), this 
can be shown directly. For repulsive r~” potentials, completeness was proved by 
Y. Pao, Comm. Pure Appl. Math. 27, 407 (1974). 
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The identification of 6T (x, t) with local fluctuations of the temperature, apart 
from the normalization factor, can be justified by considering the local inter- 
nal energy 


e+de= S(n-+in)K(T + 62) , 


from which, neglecting second-order quantities, it follows that 


de — zOnkT 


3 3 
de = snkoT + skTon => T= Si (9.4.19) 
Similarly, we obtain for 
én 
t) = a*(x,t) = (x"|v) = | @Budf(x,v,t) 
Aes) = aad) =m) = | Podsev.) 7 = Te 
(9.4.18b) 


= f Poe + on = i=1,2,3.  (9.4.18c) 
N/M 


These expressions show the relations to the density and momentum fluctu- 
ations. We now insert the expansion (9.4.17) into the linearized Boltzmann 
equation (9.4.6) 


[oe) 


( 2 + wv) v(x,v,t) =— S> a® (x, twyX* (v) — VV (x,t) . (9.4.20) 


r/=6 


Only terms with \’ > 6 contribute to the sum, since the collisional invariants 
have the eigenvalue 0. Multiplying this equation by ~? f°(v)/kT and integrat- 
ing over v, we obtain, using the orthonormalization of ¥* from Eq. (9.4.15), 


0 Xr = aN AN! a 
vi Gt) + D(x Ive’) a (x, ) 
= —w a(x, t) — (%*|v) VV (x,t). (9.4.21) 


Fourier transformation 


a(x, t) = / (a)? x eilk-x—wt) (Ik, w) (9.4.22) 
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(w+ iwy)a*(k,w) —k (lve) a (k,w) — k (Q|v) V(k,w) = 0. 
N=1 
(9.4.23) 


Which quantities couple to each other depends on the scalar products 
(lve), whereby the symmetry of the ¥% clearly plays a role. 

Since w, = 0 for the modes \ = 1 to 5, i.e. momentum, energy, and 
particle-number density, the structure of the conservation laws for these quan- 
tities in (9.4.23) can already be recognized at this stage. The term containing 
the external force obviously couples only to ¥' = ¥“‘ for reasons of symmetry 


si ait AOE es 9.4.24 
(Xe) ul Vn/ (9.4.24) 


For the modes with \ < 5, 


war(k,w) —k 57 (lve) a (k,w) —k (ZA|v) V(k,w) =0 (9.4.25) 
N=1 


holds, and for the non-conserved degrees of freedom!” \ > 6, we have 


a*(k,w) = hi (>: Cauran) a (k,w) 


+ > (x 10:e"") a (ew) + (iu) Vee) . (9.4.26) 


»’/=6 


This difference, which results from the different time scales, forms the 
basis for the elimination of the non-conserved degrees of freedom. 


9.4.4 The Hydrodynamic Limit 


For low frequencies (w < w*) and (vk < w%), a(k,w) with \ > 6 is of 
higher order in these quantities than are the conserved quantities \ = 1,...,5. 
Therefore, in leading order we can write for (9.4.26) 


: 5 
a*(k,w) = a (> (x |i’) a (k,w) + (Xvi) i) . (9.4.27) 


ae 
Inserting this into (9.4.25) for the conserved (also called the hydrodynamic) 
variables, we find 


'7 Here, the Einstein summation convention is employed: repeated indices i,j, 1,7 
are to be summed over from 1 to 3. 
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w) — ky ‘: (x lv.2"') a® (k,w) 


N=1 
+ ikiky So Quix") a (a es) a (i) — be (RA) V(k, ) 
N=1 p=6 


— ki 3 (x 1vie*’) (=) (x ey) V(k,w) =0; (9.4.28) 
N=6 


this is a closed system of hydrodynamic equations of motion. The second 
term in these equations leads to motions which propagate like sound waves, 
the third term to damping of these oscillations. The latter results formally 
from the elimination of the infinite number of non-conserved variables which 
was possible due to the separation of the time scales of the hydrodynamic 
variables (typical frequency ck, Dk?) from the that of the non-conserved 
variables (typical frequency w, « 77'). 

The structure which is visible in Eq. (9.4.28) is of a very general nature 
and can be derived from the Boltzmann equations for other physical systems, 
such as phonons and electrons or magnons in solids. 

Now we want to further evaluate Eq. (9.4.28) for a dilute gas without the 
effect of an external potential. We first compute the scalar products in the 
second term (see Eqns. (9.4.16a—c)) 


x ei f°(v) VjV5 kT 
™"axi) = | du j__s— §,, 9.4.29a 
(eleai)= | bo rete = bY es 
mv — 3kT 
2kT 
(x7 |v: x’) )= fart fly w),,, = 6i4/ >— - (9.4.29b) 
2 3nkT a 


These scalar products and ({7|vjx"7) = (XT |v;X7) are the only finite 
scalar products which result from the flow term in the equation of motion. 

We now proceed to analyze the equations of motion for the particle- 
number density, the energy density, and the velocity. In the equation of 
motion for the particle-number density, \ = 5 (9.4.28), there is a coupling to 
a'(k,w) due to the second term. As noted above, all the other scalar products 
vanish. The third term vanishes completely, since (Y"|vix") a (u|X") = 0 
for > 6 owing to the orthonormalization. We thus find 


kT ., 
wi(k,w) — ki4/ —a'(k,w) =0, (9.4.30) 
m 


or, due to (9.4.18), 
wén(k, wv) — knu'(k,w) =0, (9.4.30) 


or in real space 
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O 
ras t)+ Vnu(x,t)=0. (9.4.30”) 
This equation of motion is identical with the equation of continuity for the 
density, (9.3.24), except that here, n(x, t) in the gradient term is replaced by 
n because of the linearization. 


The equation of motion for the local temperature, making use of (9.4.28), 
(9.4.18a), and (9.4.29b), can be cast in the form 


3n 2kT nuj(k, w) 
——kéT(k ky 
Wart oe) 3m ai 


+ ikjk; s; x 4a, ee) = (xe) a® (k,w) =0. (9.4.31) 


N=1 p=6 


In the sum over 2’, the term \’ = 5 makes no contribution, since (X""|vjX°) « 
(X“|v;) = 0. Due to the fact that 4 transforms as a scalar, ~“ must transform 
like v;, so that due to the second factor, al = ¥' also makes no contribution, 
leaving only gy = ¥*. Finally, only the following expression remains from 
the third term of Eq. (9.4.31): 


Bae a ee ae ee 
ikiky 5) (X*|uix") — (xt lujX") ah(k, w) 
= LL 


ww ikikyt 5” (x4 |viX") (X" [vj X74) a4 (k, w) 
uU=6 
5 
= ikskyr ((x*foivsxt) — D> Cetloux?) (0 lay) aw) 
A=1 


= ikgky7 ((X4|vivj x4) — (4 |vix") (Xu; X*))a*(k,w) . (9.4.32) 


In this expression, all the w7' were replaced by the collision time, w7! = 7, 
and we have employed the completeness relation for the eigenfunctions of L 


as well as the symmetry properties. We now have 


(x4 |aiX*) = as (9.4.33a) 


where here, we do not sum over 7, and 


2 
mgt)" — your + (Sar)? 
Sn(kT)? 


Giant) aay! feo sroreeh 


kT 
a = by _ (9.4.33b) 
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Thus the third term in Eq. (9.4.31) becomes ik?D,/3n/2kTk6T, with the 
coefficient 


5 kT r K 
3 m My ’ (34) 
where 
3 
Cy = gtk (9.4.35) 


is the specific heat at constant volume, and 
52 
k= gtk TT (9.4.36) 


refers to the heat conductivity. All together, using (9.4.32)—(9.4.34), we obtain 
for the equation of motion (9.4.31) of the local temperature 


[/QkT , 
wa*(k, w) — ky Sar (kw) + ik? Da*(k,w) =0, (9.4.37) 
mm 
or 
2T 


oT — 
. 3n 


k-nu+ik?DédT =0, (9.4.37') 


or in real space, 


0 2T 
—T(x,t)+ — 
Ot v0) 3n 
Connection with phenomenological considerations: 
The time variation of the quantity of heat dQ is 


Vnu(x,t) — DV?T(x,t) =0. (9.4.37”) 


6Q = —Via (9.4.38a) 
with the heat current density jg. In local equilibrium, the thermodynamic relation 
6Q = cpdbT (9.4.38b) 


holds. Here, the specific heat at constant pressure appears, because heat diffusion is 
isobaric owing to csk >> Dk? in the limit of small wavenumbers with the velocity 
of sound cs and the thermal diffusion constant D;. The heat current flows in the 
direction of decreasing temperature, which implies 


K 
jg =-—VT A. 
Ja a (9.4.38c) 


with the thermal conductivity «. Overall, we thus obtain 


acy pe (9.4.38d) 
dt mecp 


a diffusion equation for the temperature. 
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Finally, we determine the equation of motion of the momentum density, 
ie. for a, 7 = 1,2,3. For the reversible terms (the first and second terms 


in Eq. (9.4.28)), we find by employing (9.4.18b-c) and (x! [vin ) = 0 the 
result 


wa! (k,w) — k; ((x7 |viX°) a?(k,w) + (x2 |vix*) a‘ (k,w)) 
= (= (or) = bjin(k,w) = ky" koT(k,4) (9.4.39) 


= (2 (ore) - ke) 


where, from P(x,t) = n(x,t)kT(x,t) = (n + dn(x,t))k(T + 6T(x,t)), it 
follows that 


6P =nkdT +kToén , 


which was used above. For the damping term in the equation of motion of 
the momentum density, we obtain from (9.4.28) using the approximation 
w,, = 1/7 the result: 


5 co 
ikiky S> SO (R?viX*) - (xe) a (k,w) 


N=1 u=6 i 


= ih > (oni ) (etree a) 


pU=6 (9.4.40) 


ss iskir(( eb non ) 
&(Hialo)(ele sere) eto 


In the second line, we have used the fact that the sum over \’ reduces to 
r =1,2,3. For the first term in the curved brackets we obtain: 
m 


se) ~ nkT 


kT 
ry (Outdtr + d510ir + 55rdi2) - 


vi|v1 dv f(v)ujviv, 


ae 


I 


For the second term in the curved brackets in (9.4.40), we need the results 
of problem 9.12, leading to 5ij Or As a result, the overall damping term 
(9.4.40) is given by 
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5 fore) 
ikiky S> SO (3 lvix*) — (x4 lu e’) a® (k, w) 


N=1 p=6 He 


RE 5 : 
= ikjkyr Cz t 5510%r t bjr Oil +5 a (k, w) 


(9.4.40’) 
=i (ihr) (-5) + kikjui(k,w) + kk (2) ) TkT ~ 
{1 n 
=i (5% (k- u(k,w)) + K?uy(k) rity) 
Defining the shear viscosity as 
n=ntkT , (9.4.41) 


we find with (9.4.39) and (9.4.40’) the following equivalent forms of the equa- 
tion of motion for the momentum density: 


wnu,(k,w) — ~ ky6P (ku) +it (5% (ku(k, w)) +I2uj(k.w)) =0, 


(9.4.42) 


or, in terms of space and time, 


S mnuy(x,)+V;P(,t)—n (Gv, (V - u(x,t)) + V7uj0x,t)) =0 (9.4.42) 


or 


o mnny (x,t) + Pyre (x,t) = 0 (9.4.42’") 


with the pressure tensor (Pjx,4 = ViPjr, etc.) 
2 
Pix (x,t) = 6;4P(x, t)—7 (unto + Ur,j (x,t) — 3 sat (%)) . (9.4.43) 


We can compare this result with the general pressure tensor of hydrody- 
namics: 


2 
Plot) = Sx Plot) — 0 (upalot) + 906) — F8ixtnel34)) — 


_ Cd5nt1,1(X, t) . (9.4.44) 


Here, ¢ is the bulk viscosity, also called the compressional viscosity. As a 
result of Eq. (9.4.44), the bulk viscosity vanishes according to the Boltzmann 
equation for simple monatomic gases. The expression (9.4.41) for the viscosity 
can also be written in the following form (see Eqns. (9.2.12) and (9.2.13)): 
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2 
mu 1 MUth 
n= tnkT = Tn —® = <nmupl = 


4.4 
3 3 30tot : (9 5) 


where yn = \/3kT/m is the thermal velocity from the Maxwell distribution; 
i.e. the viscosity is independent of the density. 
It is instructive to write the hydrodynamic equations in terms of the nor- 


malized functions i = Tay er’ etc. instead of the usual quantities n(x, t), 
n 


T (x,t), ui(x, t). From Eqns. (9.4.30), (9.4.37), and (9.4.42’) it follows that 


f(x, t) = —enViti'(x, t) (9.4.46a) 

T(x, t) = —erVi0'(x, t) + DV?F(x, t) (9.4.46b) 

éi(x,t) = —enVit — er Vit + “EV? ai + VV - a) (9.4.46c) 
mn 3mn 


with the coefficients cn = \/kT/m, cr = \/2kT/3m, D and from 
Eqns. (9.4.34) and (9.4.41). Note that with the orthonormalized quantities, 


the coupling of the degrees of freedom in the equations of motion is symmet- 
ric. 


9.4.5 Solutions of the Hydrodynamic Equations 


The periodic solutions of (9.4.46a-c), which can be found using the ansatz 
f(x, t) x ai(x,t) x T(x,t) « e(*-“), are particularly interesting. The 
acoustic resonances which follow from the resulting secular determinant and 
the thermal diffusion modes have the frequencies 


w = +c,k — 5 Dek? (9.4.47a) 
w = —iDrk? (9.4.47b) 


with the sound velocity c,, the acoustic attenuation constant D,, and the 
heat diffusion constant (thermal diffusivity) Dr 


—— 2; 2 = a — A.A 
‘ re 3m mms (hase) 
4 ie”, 29 
| pee aa ( ) (9.4.48b) 
38mn mn \Cy cp 
Deep aa, (9.4.48) 
Cp mMmcp 


In this case, the specific heat at constant pressure enters; for an ideal gas, it 
is given by 


5 
cp = gtk : (9.4.49) 
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The two transverse components of the momentum density undergo a purely 
diffusive shearing motion: 
2 
Dr = ihe : (9.4.50) 
mn 
The resonances (9.4.47a,b) express themselves for example in the density- 
density correlation function, S;,,(k,w). The calculation of dynamic suscep- 
tibilities and correlation functions (problem 9.11) starting from equations of 
motion with damping terms is described in QM II, Sect. 4.7. The coupled sys- 
tem of hydrodynamic equations of motion for the density, the temperature, 
and the longitudinal momentum density yields the density-density correlation 
function: 


On 
S ( w) (3), 
ol oh De (1 &) (we? = 2k?) Drk? | (1 - ) Drk? 
(w? — egk?)? + (wD5k?)? "w+ (Drk?)? 


(9.4.51) 


The density-density correlation function for fixed k is shown schematically 
as a function of w in Fig. 9.3. 


Fig. 9.3. The density- 
density correlation func- 
tion for fixed k as a func- 
tion of w 


The positions of the resonances are determined by the real parts and their 
widths by the imaginary parts of the frequencies (9.4.47a, b). In addition to 
the two resonances representing longitudinal acoustic phonons at +c;k, one 
finds a resonance at w = 0 related to heat diffusion. The area below the curve 
shown in Fig. 9.3, which determines the overall intensity in inelastic scattering 
experiments, is proportional to the isothermal compressibility (33) The 
relative strength of the diffusion compared to the two acoustic resonances 
is given by the ratio of the specific heats, ae This ratio is also called 
the Landau-Placzek ratio, and the diffusive resonance in Spn(k,w) is the 
Landau—Placzek peak. 
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Since the specific heat at constant pressure diverges as (T — T.)~7, while 
that at constant volume diverges only as (T — T.)~° (p. 256, p. 255), this 
ratio becomes increasingly large on approaching T,.. The expression (9.4.51), 
valid in the limit of small k (scattering in the forward direction), exhibits the 
phenomenon of critical opalescence, as a result of (On/OP)r « (T — Ty)77. 
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9.5.1 Relaxation-Time Approximation 


The general evaluation of the eigenvalues and eigenfunctions of the linear 
collision operator is complicated. On the other hand, since not all the eigen- 
functions contribute to a particular diffusion process and certainly the ones 
with the largest weight are those whose eigenvalues w are especially small, we 
can as an approximation attempt to characterize the collision term through 
only one characteristic frequency, 


(= +vv) f(x,v,t) = ~=(f(x,v,t) — f*(x,v,t)) . (9.5.1) 
This approximation is called the conserved relaxation time approximation, 
since the right-hand side represents the difference between the distribu- 
tion function and a local Maxwell distribution. This takes into account 
the fact that the collision term vanishes when the distribution function is 
equal to the local Maxwell distribution. The local quantities n(x, t), u’(x, t) 
and e(x,t) which occur in f‘(x,v,t) can be calculated from f(x,v,t) using 
Eqns. (9.3.15a), (9.3.15b), and (9.3.15c’). 
Our goal is now to calculate f or f — f*. We write 


6) ) 1 
—tvV¥ | (f—f)+(—+vv) ff =--(¢-f)):; (9.5.2) 
Ot Ot T 

In the hydrodynamic region, wr < 1, vkr < 1, we can neglect the first 
term on the left-hand side of (9.5.2) compared to the term on the right side, 
obtaining f — f’ = (2 + vv) f*. Therefore, the distribution function has 
the form 


fafi+r (= + vv) cia (9.5.3) 
and, using this result, one can again calculate the current densities, in an 
extension of Sect. 9.3.5.2. In zeroth order, we obtain the expressions found 
in (9.3.35) and (9.3.36) for the reversible parts of the pressure tensor and the 
remaining current densities. The second term gives additional contributions 
to the pressure tensor, and also yields a finite heat current. Since f’ depends 
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on x and ¢ only through the three functions n(x, t), T(x,t), and u(x,t), the 
second term depends on these and their derivatives. The time derivatives of 
f* or n, T, and u can be replaced by the zero-oder equations of motion. 
The corrections therefore are of the form Wn(x,t), WI(x, t), and Vuj(x, t). 
Along with the derivatives of P;; and q which already occur in the equations 
of motion, the additional terms in the equations are of the type TV°T(x, t) 
etc. (See problem 9.13). 


9.5.2 Calculation of W(v1, v2; vj, V9) 


The general results of the Boltzmann equation did not depend on the pre- 
cise form of the collision probability, but instead only the general relations 
(9.2.8a-f) were required. For completeness, we give the relation between 
W(v1,V2;vj,Vv5) and the scattering cross-section for two particles!®. It is 
assumed that the two colliding particles interact via a central potential 
w(xX1 — X2). We treat the scattering process 


/ / 
Vi,V2 > Vi,V0; 


in which particles 1 and 2, with velocities v; and v2 before the collision, are 
left with the velocities v{ and v4 following the collision (see Fig. 9.4). The 
conservation laws for momentum and energy apply; owing to the equality of 
the two masses, they are given by 


VitV2=Vi tv (9.5.4a) 


VWeveevi7+vi". (9.5.4b) 


V1 


Fig. 9.4. The collision of two par- 
V2 ticles 


18 The theory of scattering in classical mechanics is given for example in L. D. Lan- 
dau and E. M. Lifshitz, Course of Theoretical Physics, Vol. I: Mechanics, 3rd Ed. 
(Butterworth—Heinemann, London 1976), or H. Goldstein, Classical Mechanics, 
2nd Ed. (Addison—Wesley, New York 1980). 
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It is expedient to introduce the center-of-mass and relative velocities; before 
the collision, they are 


V=s(vitve), u=Vvi-ve, (9.5.5a) 
and after the collision, 
1 
V= 5 (V4 +v5), w=vi-—v. (9.5.5b) 


Expressed in terms of these velocities, the two conservation laws have the 
form 


V=vV (9.5.6a) 


jul = Ju]. (9.5.6b) 


In order to recognize the validity of (9.5.6b), one need only subtract the 
square of (9.5.4a) from two times Eq. (9.5.4b). The center-of-mass velocity 
does not change as a result of the collision, and the (asymptotic) relative 
velocity does not change its magnitude, but it is rotated in space. For the 
velocity transformations to the center-of-mass frame before and after the 
collision given in (9.5.5a) and (9.5.5b), the volume elements in velocity space 
obey the relations 


udu, = PV d?u = BV du! = dv). dbus (9.5.7) 


due to the fact that the Jacobians have unit value. 

The scattering cross-section can be most simply computed in the center- 
of-mass frame. As is known from classical mechanics, !® the relative coordinate 
x obeys an equation of motion in which the mass takes the form of a reduced 
mass p (here 2 = $m) and the potential enters as a central potential w(x). 
Hence, one obtains the scattering cross-section in the center-of-mass frame 
from the scattering of a fictitious particle of mass yz by the potential w(x). 
We first write down the velocities of the two particles in the center-of-mass 
frame before and after the collision 

= 2S. S ee IS hy ne er 9.5.8) 

Vis =Vi SpU Mie Sy Vig = BU an “Mag = su. (9.5. 

We now recall some concepts from scattering theory. The equivalent po- 
tential scattering problem is represented in Fig. 9.5, and we can use it to 
define the scattering cross-section. The orbital plane of the particle is deter- 
mined by the asymptotic incoming velocity u and position of the scattering 
center O. This follows from the conservation of angular momentum in the 
central potential. The z-axis of the coordinate system drawn in Fig. 9.5 passes 
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Fig. 9.5. Scattering by a 
fixed potential, with col- 
lision parameter s and 
scattering center O. The 
particles which impinge 
on the surface element 
sdsdy are deflected into 
the solid angle element dQ 


through the scattering center O and is taken to be parallel to u. The orbit 
of the incoming particle is determined by the collision parameter s and the 
angle y. In Fig. 9.5, the orbital plane which is defined by the angle ¢ lies 
in the plane of the page. We consider a uniform beam of particles arriving 
at various distances s from the axis with the asymptotic incoming velocity 
u. The intensity J of this beam is defined as the number of particles which 
impinge per second on one cm? of the perpendicular surface shown. Letting 
n be the number of particles per cm®, then J = nlu|. The particles which 
impinge upon the surface element defined by the collision parameters s and 
s+ ds and the differential element of angle dy are deflected into the solid- 
angle element d(. The number of particles arriving in d2 per unit time is 
denoted by dN(2). The differential scattering cross-section o(,u), which 
of course also depends upon u, is defined by dN(2) = Io(Q,u)dQ, or 


,dN(2) 


o(2,u) = 17 0 


(9.5.9) 
Owing to the cylinder symmetry of the beam around the z-axis, 0(.,u) = 
o(¥,u) is independent of y. The scattering cross-section in the center-of-mass 
system is obtained by making the replacement u = |v1 — vol. 

The collision parameter s uniquely determines the orbital curve, and 
therefore the scattering angle: 


dN(§2) = Isdy(—ds) . (9.5.10) 
From this it follows using dQ = sin vdddy that 


1 ds _ 1 1ds? 
snd do sind2 de” 


o(2,u) = (9.5.11) 


From V(s) or s(#), we obtain the scattering cross-section. The scattering 
angle 7? and the asymptotic angle y, are related by 


1 
V=7-—22q4 OF Pa = 5 (0 — 9) (9.5.12) 


(cf. Fig. 9.6). 
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Fig. 9.6. The scattering angle (deflection 
angle) ? and the asymptotic angle Ya 


In classical mechanics, the conservation laws for energy and angular mo- 
mentum give 


bs i os s 
va= | dr -| dr 
Tmin 724/2u(E — w(r)) — & r 2,/1 — 82 _ 2w(r) 


r2 r2 pou 
(9.5.13) 
here, we use 
l= psu (9.5.14a) 
to denote the angular momentum and 
E= eu (9.5.14b) 


for the energy, expressed in terms of the asymptotic velocity. The distance 
Tmin Of closest approach to the scattering center is determined from the con- 
dition (7 = 0): 
[2 
w(Tmin) + —z— = E. (9.5.14c) 
2uT min 

As an example, we consider the scattering of two hard spheres of radius 

R. In this case, we have 


nx wv v 
=? 3] G2 sj -- =? Brea 
8 Rsiny resin (5 4 feos 5 


from which, using (9.5.11), we find 
o(0,u) = R?. (9.5.15) 


In this case, the scattering cross-section is independent of the deflection angle 
and of u, which is otherwise not the case, as is known for example from 
Rutherford scattering’. 

After this excursion into classical mechanics, we are in a position to calcu- 
late the transition probability W(v, va; v3, v4) for the loss and gain processes 
in Eqns. (9.2.5) and (9.2.6). To calculate the loss rate, we recall the following 
assumptions: 


(i) The forces are assumed to be short-ranged, so that only particles within 
the same volume element d?a1 will scatter each other. 
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(ii) When particle 1 is scattered, it leaves the velocity element d?vy. 


To calculate the loss rate |, we pick out a molecule in d*z which has the 
velocity v; and take it to be the scattering center on which molecule 2 with 
velocity v2 in the velocity element d°vz impinges. The flux of such particles is 
f (x, V2, t)|v2—vi|d?v2. The number of particles which impinge on the surface 
element (—sds)dy per unit time is 


f(x, v2, t)|ve = vi|d?v2(—s ds)dy — 
= f(x, V2, t)|ve — vild?veo(Q, |vi — vo|)dQ2 . 


In order to obtain the number of collisions which the particles within 
d°xd*v, experience in the time interval dt, we have to multiply this result by 
f(x, vi, t)d?xd3vu,dt and then integrate over v2 and all deflection angles dQ: 


Id?xd?v,dt = [eee f ates,vi.t)F(,¥2,6)Iv2 —vi| x 
x a(2,|vi — ve|)d?ad? vi dt. (9.5.16) 


To calculate the gain rate g, we consider scattering processes in which a 
molecule of given velocity v{ is scattered into a state with velocity v1 by a 
collision with some other molecule: 


gd? ad*v,dt = fae f eupa sin, — vg] o(2, |v — vgl) x 
x f(x, vi, t)f(x,v,t)d°adt. (9.5.17) 


The limits of the velocity integrals are chosen so that the velocity v1 lies 
within the element d*v;. Using (9.5.7), we obtain for the right side of (9.5.17) 


Pv, [der faa —v9| o(Q, lvi — vol) f(x, vi, OS (x, vh, t)d?adt ; 
i.e. 
g= [der faa, —vo| (2, |vi — vol) f(x, vi, OF (%, vo, t) . (9.5.18) 


Here, we have also taken account of the fact that the scattering cross-section 
for the scattering of vi,v5 — vi, V2 is equal to that for v1, v2 > v4, v4, since 
the two events can be transformed into one another by a reflection in space 
and time. 

As a result, we find for the total collision term: 


ot) = g-l= f doy dQ\ve—vilo(2, Iva vil) (fifa) - (9.5.19) 
coll 
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The deflection angle J can be expressed as follows in terms of the asymptotic 
relative velocities!® 


(vi — v2)(vi — va) 


v = arccos ; aE 
lv1 — vallvi — v5 


The integral { dQ refers to an integration over the direction of u’. With the 
rearrangements 


2 2 2 
uw —w=vi —2vivad+ve — v2 + 2vive— ve 


Av"? 4 2vi? avi,” LAV? — 2v? —2v2 = 2(v/7 + vi? — v? — v3) 


and 


[eeiva—vil= fanu= fau' desu’ = wu! 
2 2 
asp alige uu 
= f atu ins( 5 4 
ul? ue 
= f au's 7 5 paves (v’—V) 


12 2 2 2 
=4 fenane (4 + Vo vio + V2 Jae (vies aap ee) 


2 2 


which also imply the conservation laws, we obtain 


72 , Produ} Bus W (v1,va:v),v5)(F fs — fife) - (9.5.20) 


In this expression, we use 


W(vr,vaivi. v4) = 4o(2, Iv — vil)é (= a *) : 
x 68) (vi tvh—vi—ve) . (9.5.21) 

Comparison with Eq. (9.2.8f) yields 
o(V1, V2; V4, Vv) = 4m4o(Q, |v — vil) - (9.5.22) 


From the loss term in (9.5.19), we can read off the total scattering rate 
for particles of velocity v1: 


= [ae fan Ivo — vi| a(Q, lvo — vil) f(x, vo,t). (9.5.23) 


= exn 
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The expression for tT! corresponds to the estimate in Eq. (9.2.12), which 
was derived by elementary considerations: T~! = nvynotot, with 


Otot = | ano(0. |vo — vil) = an | dss. (9.5.24) 
0 


Tmax is the distance from the scattering center for which the scattering angle 
goes to zero, i.e. for which no more scattering occurs. In the case of hard 
spheres, from Eq. (9.5.15) we have 


Otot = 40 R? . (9.5.25) 


For potentials with infinite range, rmax diverges. In this case, the collision 
term has the form 


O oo 20 
ar) fee ase) de(fi fs — fife)ivi — val - (9.5.26) 


Although the individual contributions to the collision term diverge, the over- 
all term remains finite: 


Tmax 


lim ds s(fi fs — fif2) = finite , 
Tmax J 

since for s — oo, the deflection angle tends to 0, and vi — vi — 0 and 

v5, — V2 — 0, so that 


(fifa — fife) +0. 
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Problems for Chapter 9 


9.1 Symmetry Relations. Demonstrate the validity of the identity (9.3.5) used to 
prove the H theorem: 


[eu feerfeufars W (v1, V2; v3, v4) (fife — fafa)yr 


— ; [ev ferfevs fev W (v1, V23;V3, va) 
x (fife — fsfs)(~1 + v2 — 3 — ys). (9.5.27) 


9.2 The Flow Term in the Boltzmann Equation. Carry out the intermediate steps 
which lead from the equation of continuity (9.2.11) for the single-particle distribu- 
tion function in z—space to Eq. (9.2.11’). 


9.3 The Relation between H and S. Calculate the quantity 
H(x,t) = [iv F(x.) 108 F(x, vt 


for the case that f(x,v,t) is the Maxwell distribution. 


9.4 Show that in the absence of an external force, the equation of continuity 
(9.3.28) can be brought into the form (9.3.32) 


n(Or t uj0;)e { O70 => Pj Oj; : 


9.5 The Local Maxwell Distribution. Confirm the statements made following 
Eq. (9.3.19’) by inserting the local Maxwell distribution (9.3.19’) into (9.3.15a)— 
(9.3.15c). 


9.6 The Distribution of Collision Times. Consider a spherical particle of radius 
r, which is passing with velocity v through a cloud of similar particles with a 
particle density n. The particles deflect each other only when they come into direct 
contact. Determine the probability distribution for the event in which the particle 
experiences its first collision after a time t. How long is the mean time between two 
collisions? 


9.7 Equilibrium Expectation Values. Confirm the results (G.1c) and (G.1g) for 


[ee (2) Pe and [ev vRvivjut fo(v) « 


9.8 Calculate the scalar products used in Sect. 9.4.2: (1|1), (€|1), (ele), (vilvs), 
(1x4), (x4 |vix?), (X°|vix?), (x*|v? 24), and (uj |vix*). 
9.9 Sound Damping. In (9.4.30), (9.4.37”) and (9.4.42”), the linearized hydro- 


dynamic equations for an ideal gas were derived. For real gases and liquids with 
general equations of state P = P(n,T), analogous equations hold: 


O 
apts) +nV-u(x,t) =0 


mn Sus (x,t) + 0; Pji(x, t) = 0 


Deh (Ge) Vm - BYTE. =e 


On 
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The pressure tensor P;;, with components 
2 
Pig = 0i3P — 9 (Vy us + Vig) + (31 Z c) dig VU 


now however contains an additional term on the diagonal, —CV-u. This term results 
from the fact that real gases have a nonvanishing bulk viscosity (or compressional 
viscosity) ¢ in addition to their shear viscosity 7. Determine and discuss the modes. 

Hint: Keep in mind that the equations partially decouple if one separates the 
velocity field into transverse and longitudinal components: u = uz+u; with V-uz = 
0 and V x u; = 0. (This can be carried out simply in Fourier space without loss of 
generality by taking the wavevector to lie along the z-direction.) 

In order to evaluate the dispersion equations (eigenfrequencies w(k)) for the 
Fourier transforms of n, uz, and T’, one can consider approximate solutions for w(k) 
of successively increasing order in the magnitude of the wavevector k. A useful 
abbreviation is 


n= (2),-(2),0-@,/@J-@,2 


Here, cs is the adiabatic velocity of sound. 


9.10 Show that 


( = [vix*) = bij (vilx'x*) = 65; aN (x! 


Van 


and verify (9.4.40’). 


9.11 Calculate the density-density correlation function Snn(k,w) = f Br f dt 
e 10“) (n(x, t)n(0,0)) and confirm the result in (9.4.51) by transforming to 
Fourier space and expressing the fluctuations at a given time in terms of ther- 
modynamic derivatives (see also QM I], Sect. 4.7). 


9.12 The Viscosity of a Dilute Gas. In Sect. 9.4, the solution of the linearized 
Boltzmann equation was treated by using an expansion in terms of the eigenfunc- 
tions of the collision operator. Complete the calculation of the dissipative part of 
the momentum current, Eq. (9.4.40). Show that 

: U5 xX aX Ur 5kT 
sn J |viX")(X° lor ) = 053 01r a 


1 n/m n/m 


9.13 Heat Conductivity Using the Relaxation-Time Approach. A further pos- 
sibility for the approximate determination of the dissipative contributions to the 
equations of motion for the conserved quantities particle number, momentum and 
energy is found in the relaxation-time approach introduced in Sect. 9.5.1: 


3) ee ia ay 
collision 


Ot T 


For g = f — f*, one obtains in lowest order from the Boltzmann equation (9.5.1) 
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g(x, v,t) = r(a { viV+ ZF: Vy) ae Ae 


Eliminate the time derivative of f* by employing the non-dissipative equations of 
motion obtained from f* and determine the heat conductivity by inserting f = f°+g 
into the expression for the heat current q derived in (9.3.29). 


9.14 The Relaxation-Time Approach for the Electrical Conductivity. Consider an 
infinite system of charged particles immersed in a positive background. The col- 
lision term describes collisions of the particles among themselves as well as with 
the (fixed) ions of the background. Therefore, the collision term no longer van- 
ishes for general local Maxwellian distributions f* (x,v,t). Before the application 
of a weak homogeneous electric field E, take f = f°, where f° is the position- 
and time-independent Maxwell distribution. Apply the relaxation-time approach 
Of /Ot|con = —(f — f°)/7 and determine the new equilibrium distribution f to first 
order in E£ after application of the field. What do you find for (v)? Generalize to a 
time-dependent field E(t) = Eo cos(wt). Discuss the effects of the relaxation-time 
approximation on the conservation laws (see e.g. John M. Ziman, Principles of the 
Theory of Solids, 2°" Ed. (Cambridge University Press, Cambridge 1972)). 


9.15 An example which is theoretically easy to treat but is unrealistic for atoms 
is the purely repulsive potential!® 


K 
vy—lrv-l 


w(r) = », vr22jn>O0. (9.5.28) 


Show that the corresponding scattering cross-section has the form 
2m \ v=T 
K\ a oe 
a(v, \vi — V2|) = (=) \vi = Vo| v=T Fy, (#) 5 (9.5.29) 


with functions F, (0) which depend on @ and the power v. For the special case of 
the so called Maxwell potential (v = 5), |vi — v2|o(v,|vi — va|) is independent of 
\vi = Vo. 


9.16 Find the special local Maxwell distributions 
v2 
f°(v,x,t) = exp (+B : v+ox.) 
2m 


which are solutions of the Boltzmann equation, by comparing the coefficients of the 
powers of v. The result is A = A; + Ao-x+C3x*, B = Bi — Act — (2C3t + C2)x4 
Qxx, C =C1 + Cot + C3t?. 


9.17 Let an external force F(x) = —VV(x) act in the Boltzmann equation. Show 
that the collision term and the flow term vanish for the case of the Maxwell distri- 
bution function 


jon glin)” [=p (MGR + v0). 


9.18 Verify Eq. (9.4.33b). 


19 Landau/Lifshitz, Mechanics, p. 51, op. cit. in footnote 18. 


10. Irreversibility and the Approach to 
Equilibrium 


10.1 Preliminary Remarks 


In this chapter, we will consider some basic aspects related to irreversible 
processes and their mathematical description, and to the derivation of macro- 
scopic equations of motion from microscopic dynamics: classically from the 
Newtonian equations, and quantum-mechanically from the Schrodinger equa- 
tion. These microscopic equations of motion are time-reversal invariant, and 
the question arises as to how it is possible that such equations can lead to 
expressions which do not exhibit time-reversal symmetry, such as the Boltz- 
mann equation or the heat diffusion equation. This apparent incompatibility, 
which historically was raised in particular by Loschmidt as an objection to 
the Boltzmann equation, is called the Loschmidt paradox. Since during his 
lifetime the reality of atoms was not experimentally verifiable, the apparent 
contradiction between the time-reversal invariant (time-reversal symmetric) 
mechanics of atoms and the irreversibility of non-equilibrium thermodynam- 
ics was used by the opponents of Boltzmann’s ideas as an argument against 
the very existence of atoms!. A second objection to the Boltzmann equation 
and to a purely mechanical foundation for thermodynamics came from the 
fact — which was proved with mathematical stringence by Poincaré — that 
every finite system, no matter how large, must regain its initial state periodi- 
cally after a so called recurrence time. This objection was named the Zermelo 
paradox, after its most vehement protagonist. Boltzmann was able to refute 
both of these objections. In his considerations, which were carried further by 
his student P. Ehrenfest?, probability arguments play an important role, as 
they do in all areas of statistical mechanics — a way of thinking that was how- 
ever foreign to the mechanistic worldview of physics at that time. We mention 
at this point that the entropy which is defined in Eq. (2.3.1) in terms of the 
density matrix does not change within a closed system. In this chapter, we 
will denote the entropy defined in this way as the Gibbs’ entropy. Boltzmann’s 


' See also the preface by H. Thirring in E. Broda, Ludwig Boltzmann, Deuticke, 
Wien, 1986. 

? See P. Ehrenfest and T. Ehrenfest, Begriffliche Grundlagen der statistischen Auf- 
fassung in der Mechanik, Encykl. Math. Wiss. 4 (32) (1911); English translation 
by M. J. Moravcsik: The Conceptual Foundations of the Statistical Approach in 
Mechanics, Cornell University Press, Ithaca, NY 1959. 
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concept of entropy, which dates from an earlier time, associates a particular 
value of the entropy not only to an ensemble but also to each microstate, as 
we shall show in more detail in Sect. 10.6.2. In equilibrium, Gibbs’ entropy 
is equal to Boltzmann’s entropy. To eliminate the recurrence-time objection, 
we will estimate the recurrence time on the basis of a simple model. Using 
a second simple model of the Brownian motion, we will investigate how its 
time behavior depends on the particle number and the different time scales 
of the constituents. This will lead us to a general derivation of macroscopic 
hydrodynamic equations with dissipation from time-reversal invariant micro- 
scopic equations of motion. Finally, we will consider the tendency of a dilute 
gas to approach equilibrium, and its behavior under time reversal. In this 
connection, the influence of external perturbations will also be taken into ac- 
count. In addition, this chapter contains an estimate of the size of statistical 
fluctuations and a derivation of Pauli’s master equations. 

In this chapter, we treat a few significant aspects of this extensive area 
of study. On the one hand, we will examine some simple models, and on the 
other, we will present qualitative considerations which will shed light on the 
subject from various sides. 

In order to illuminate the problem arising from the Loschmidt paradox, 
we show the time development of a gas in Fig. 10.1. The reader may con- 
jecture that the time sequence is a,b,c, in which the gas expands to fill the 
total available volume. If on the other hand a motion reversal is carried out 
at configuration c, then the atoms will move back via stage b into configu- 
ration a, which has a lower entropy. Two questions arise from this situation: 
(i) Why is the latter sequence (c,b,a) in fact never observed? (ii) How are 
we to understand the derivation of the H theorem, according to which the 
entropy always increases? 


Fig. 10.1. Expansion or contraction of a gas: total volume V, subvolume Vi (cube 
in lower-left corner) 
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10.2 Recurrence Time 


Zermelo (1896)? based his criticism of the Boltzmann equation on Poincaré’s 
recurrence-time theorem’. It states that a closed, finite, conservative system 
will return arbitrarily closely to its initial configuration within a finite time, 
the Poincaré recurrence time tp. According to Zermelo’s paradox, H(t) could 
not decrease monotonically, but instead must finally again increase and regain 
the value H(0). 

To adjudge this objection, we will estimate the recurrence time with the 
aid of a model’. We consider a system of classical harmonic oscillators (linear 
chain) with displacements g,, momenta p, and the Hamiltonian (see QM II, 
Sect. 12.1): 


N 
1 @ 
=e { pr + S (dn in-1)"} (10.2.1) 


From this, the equations of motion are obtained: 
Bn = Mn = M2 (Gn41 + dn—1 — 2Gn) - (10.2.2) 


Assuming periodic boundary conditions, qq = qn, we are dealing with a 
translationally invariant problem, which is diagonalized by the Fourier trans- 
formation 


: / i 


Q, and (P,) are called the normal coordinates (and momenta). The periodic 
boundary conditions require that 1 = e'®, i.e. s = 2a with integral /. The 
values of s for which / differs by N are equivalent. A possible choice of values 
of 1, e.g. for odd N, would be: 1 = 0,+1,...,4(N — 1)/2. Since qn and pp, 
are real, it follows that 


Q* =Q_. and P*=P_,. 


The Fourier coefficients obey the orthogonality relations 


Tl N 
a S- else isin — A(s = s') = 


n=1 


i for s — s’ = 27h with h integral 


0 otherwise 


(10.2.4) 


3 BE. Zermelo, Wied. Ann. 57, 485 (1896); ibid. 59, 793 (1896). 
4H. Poincaré, Acta Math. 13, 1 (1890) 
° P. C. Hemmer, L. C. Maximon, and H. Wergeland, Phys. Rev. 111, 689 (1958). 
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and the completeness relation 
~ Spee Soa: (10.2.5) 
8 
Insertion of the transformation to normal coordinates yields 
H = aS (P,P* + w2QsQ*) (10.2.6) 


Ss 


with the dispersion relation 
Ww, = 22 |sin 5 (10.2.7) 


We thus find N non-coupled oscillators with eigenfrequencies® w,. The motion 
of the normal coordinates can be represented most intuitively by introducing 
complex vectors 


Zs = Ps, +iwsQs, (10.2.8) 
which move on a unit circle according to 
Z, = a,e** (10.2.9) 


with a complex amplitude a, (Fig. 10.2). 


Ap Z2 
Zz ap 


Fig. 10.2. The motion of 
1 2 3 the normal coordinates 


We assume that the frequencies w, of N — 1 such normal coordinates are 
incommensurate, i.e. their ratios are not rational numbers. Then the phase 
vectors Z, rotate independently of one another, without coincidences. We 
now wish to calculate how much time passes until all N vectors again come 
into their initial positions, or more precisely, until all the vectors lie within 
an interval Ay around their initial positions. The probability that the vector 
Zs lies within Ay during one rotation is given by Ay/27, and the proba- 
bility that all the vectors lie within their respective prescribed intervals is 
(Ay/2r)‘~*. The number of rotations required for this recurrence is there- 
fore (27/ Avy, The recurrence time is found by multiplying by the typical 


® The normal coordinate with s = 0, ws = 0 corresponds to a translation and need 
not be considered in the following. 
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rotational period” +: 


N-1 
20 1 
~ (20 oe, 10.2.10 
a) Ww 
Taking Ay = =, N = 10 and w = 10 Hz, we obtain tp © 10!” years, 


i.e. more than the age of the Universe. These times of course become much 
longer if we consider a macroscopic system with N ~ 107°. The recurrence 
thus exists theoretically, but in practice it plays no role. We have thereby 
eliminated Zermelo’s paradox. 


Remark: We consider further the time dependence of the solution for the coupled 
oscillators. From (10.2.3) and (10.2.9) we obtain 


gn(t) = > “— (2.(0) COS West + ott) sin wst) ; (10.2.11) 


from which the following solution of the general initial-value problem is found: 


gn(t) = FL (w(0) cos(s(n—n’) wet) + #O sin(s(n—n’) wet) ) . (10.2.12) 


s 
syn! 


As an example, we consider the particular initial condition g,/ = dn/,0, Gn’ (0) = 0, 
for which only the oscillator at the site 0 is displaced initially, leading to 


gn(t) = ny S~cos(sn — 22 | sin 51) (10.2.13) 


As long as N is finite, the solution is quasiperiodic. On the other hand, in the limit 
N—-oc 


a 
On 


i ds cos(sn — 20t | sin 5) = i, ds cos(s2n — 2t sins) 


—T 


Qn(t) 


= Jan(2Nt) ~ 4/ — cos(2Qt — mn — ) for longt. (10.2.14) 


Jp, are Bessel functions’. The excitation does not decay exponentially, but instead 
algebraically as ee 

We add a few more remarks concerning the properties of the solution 
(10.2.13) for finite N. If the zeroth atom in the chain is released at the time t = 0, 
it swings back and its neighbors begin to move upwards. The excitation propagates 
along the chain at the velocity of sound, a2; the n-th atom, at a distance d = na 
from the origin, reacts after a time of about t ~ %. Here, a is the lattice constant. 


” A more precise formula by P.C. Hemmer, L.C. Maximon, and H. Wergeland, 


20 
N-1 
: Me Ags 1 2-N 
op. cit. 5, yields Tp = «x —Ay*™. 
wae Ws N 
s=1 Ay 


8 1.8. Gradshteyn and I. M. Ryzhik, Table of Integrals, Series and Products, Aca- 
demic Press, New York, 1980, 8.4.11 and 8.4.51 
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The displacement amplitude remains largest for the zeroth atom. In a finite chain, 
there would be echo effects. For periodic boundary conditions, the radiated oscil- 
lations come back again to the zeroth atom. The limit N — co prevents Poincaré 
recurrence. The displacement energy of the zeroth atom initially present is divided 
up among the infinitely many degrees of freedom. The decrease of the oscillation 
amplitude of the initially excited atom is due to energy transfer to its neighbors. 


10.3 The Origin of Irreversible Macroscopic Equations 
of Motion 


In this section, we investigate a microscopic model of Brownian motion. We 
will find the appearance of irreversibility in the limit of infinitely many de- 
grees of freedom. The derivation of hydrodynamic equations of motion in 
analogy to the Brownian motion will be sketched at the end of this section 
and is given in more detail in Appendix H.. 


10.3.1 A Microscopic Model for Brownian Motion 


As a microscopic model for Brownian motion, we consider a harmonic os- 
cillator which is coupled to a harmonic lattice®. Since the overall system is 
harmonic, the Hamiltonian function or the Hamiltonian operator as well as 
the equations of motion and their solutions have the same form classically and 
quantum mechanically. We start with the quantum-mechanical formulation. 
In contrast to the Langevin equation of Sect. 8.1, where a stochastic force 
was assumed to act on the Brownian particle, we now take explicit account 
of the many colliding particles of the lattice in the Hamiltonian operator and 
in the equations of motion. The Hamiltonian of this system is given by 


H=HotHert+H_ , 

1 MQ? 1 1 

22 pe ——_(Q? Wea 2 i Pun! nYn’ 

om? +9 @ > Mr am 2uPat 3 Dy a seaeme Clee) 


nn’ 
Hr = ye CnIn@ , 


n 


Ho 


where Ho is the Hamiltonian of the oscillator of mass M and frequency 92. 
Furthermore, Hr is the Hamiltonian of the lattice! with masses m, mo- 
menta py, and displacements qn from the equilibrium positions, where we 
take m < M. The harmonic interaction coefficients of the lattice atoms are 
Pyn’. The interaction of the oscillator with the lattice atoms is given by 7/7; 


° The coupling to a bath of oscillators as a mechanism for damping has been in- 
vestigated frequently, e.g. by F. Schwabl and W. Thirring, Ergeb. exakt. Natur- 
wiss. 36, 219 (1964); A. Lopez, Z. Phys. 192, 63 (1965); P. Ullersma, Physica 
32, 27 (1966). 

10 We use the index F, since in the limit N — oo the lattice becomes a field. 
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the coefficients cy characterize the strength and the range of the interactions 
of the oscillator which is located at the origin of the coordinate system. The 
vector n enumerates the atoms of the lattice. The equations of motion which 
follow from (10.3.1) are given by 


MQ = -M’Q- > cndn 
and 


Man eS S° Pan’ Gn! = CnQ . (10.3.2) 


We take periodic boundary conditions, gd, = dn+n;, with Ni = (M,0,0), 
Ne = (0, No,0), and Ns = (0,0,.N3), where N; is the number of atoms in 
the direction é;. Due to the translational invariance of Hr, we introduce the 
following transformations to normal coordinates and momenta: 


1 ‘ka m _ika 
a e = 5 n— oa e : 2P, . 10.3.3 
tn = ae » Qk, Pa =z » i (10.3.3) 


The inverse transformation is given by 


—ika. 1 ika 
Le "G@, , PB =—<— "Dn. 10.3.4 
Qx ywoe q c Tran 2° D (10.3.4) 


The Fourier coefficients obey orthogonality and completeness relations 


1 ‘tenik! 1 
N = eikrk )-an = A(k = k’) ’ N a el (an—an’) = Onn! (10.3.5a,b) 
n k 


1 fork= 
with the generalized Kronecker delta A(k) = e le : 
0 otherwise 


From the periodic boundary conditions we find the following values for the 

wavevector: 

T2 13 
+ 

Ny. Ng 


k= gi— +82 with r; =0,+1,+2,... . 


TL 
Ni 
Here, we have introduced the reciprocal lattice vectors which are familiar 
from solid-state physics: 


2 2 2 
gi= (=, 0,0) » S2= (0,—*,0) » 3 = (0,0,*) 7 


The transformation to normal coordinates (10.3.3) converts the Hamiltonian 
for the lattice into the Hamiltonian for N decoupled oscillators, viz. 


1 


Hr =5 > (PAP + w2QhQx) ; (10.3.6) 
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with the frequencies!! (see Fig. 10.3) 


(10.3.7) 


Fig. 10.3. The frequencies wx 
kz along one of the coordinate axes, 


Wmax = Wr/a 


From the invariance of the lattice with respect to infinitesimal trans- 
lations, we obtain the condition 5°, @(n,n’) = 0, and from translational 
invariance with respect to lattice vectors t, it follows that (n+ t,n’+t) = 
@(n,n’) = &(n — n’). The latter relation was already used in (10.3.7). From 
the first of the two relations, we find limy_.9 wz = 0, ie. the oscillations of the 
lattice are acoustic phonons. Expressed in terms of the normal coordinates, 
the equations of motion are (10.3.2) 


4c 1 ‘ 
MQ =-M22’°Q- aan » c(k)* Qx (10.3.8a) 
mQx = —mwzZQx — (= c(k) Q (10.3.8b) 
with 


e(k) = cae * (10.3.9) 


For the further treatment of the equations of motion (10.3.8a,b) and the 
solution of the initial-value problem, we introduce the half-range Fourier 
transform (Laplace transform) of Q(t): 


O@y= e dt Q(t) = ‘< dt &'O(t)Q(t) . (10.3.10a) 


‘1 We assume that the harmonic potential for the heavy oscillator is based on the 
same microscopic interaction as that for the lattice atoms, @(n, n’). If we denote 
its strength by g, then we find 2 = Je and Wmaxr = J, and therefore 
Q K Wmaz. The order of magnitude of the velocity of sound is c = aWmaz. 
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The inverse of this equation is given by 
a(a(t) = [du e"Q(w). (10.3.10b) 


For free oscillatory motions, (10.3.10a) contains 64 distributions. For their 
convenient treatment, it is expedient to consider 


Q(w + in) = ye dt hott Q(z) , (10.3.11a) 


with 7 > 0. If (10.3.10a) exists, then with certainty so does (10.3.1la) owing 
to the factor e~"’. The inverse of (10.3.11a) is given by 


é"O0)= i» due! Q(w + in) , ie. 


Q(t)O(t) =] dw ei tint Q(w + in) . (10.3.11b) 


For the complex frequency appearing in (10.3.11a,b) we introduce z = w+in. 
The integral (10.3.11b) implies an integration path in the complex z-plane 
which lies in above the real axis 
oo+in ; e 
Q(t)O(t) = i dze"** Q(z) . (10.3.11b’) 


—oot+in 


The half-range Fourier transformation of the equation of motion (10.3.8a) 
yields for the first term 


| dt (0) = el Q(t) | — ic f dt e'**Q(t) 
= —Q(0) +izQ(0) — 2?Q(z). 


All together, for the half-range Fourier transform of the equations of mo- 
tion (10.3.8a,b) we obtain 


M (22 + 9) Q(z) = - = Yelk)" Gx(2) + M(Q(0) - iz Q(0)) 
VmN 
: (10.3.12) 


(10.3.13) 


The elimination of Qx(2) and replacement of the initial values Q;(0), Qx(0) 
by dn(0), dn(0) yields 
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D(z) Q(z) = M (@(0) - iz Q(0)) 
—ikan 


ys sey me Fae) (4n(0) —izgn(0)) (10.3.14) 
n ok 


D(z) = (a1 (rr) 4 ~ S- —* ) (10.3.15) 


= 2? — wi?) 


Now we restrict ourselves to the classical case, and insert the particular initial 
values for the lattice atoms gn(0) = 0, gn(0) = 0 for all the n'?, then we find 


62) = M (Q(0) = izQ@) | 
— M2? + MO? —Y), IIRE /(—2? + wh) 


(10.3.16) 


From this, in the time representation, we obtain 
dw —iztf : —iwyt 
A(HQ(t) = | — e Q(z) =-i Sg) (10.3.17) 


where w, are the poles of Q(z) and g(w,) are the residues!*. The solution 
is thus quasiperiodic. One could use this to estimate the Poincaré time in 
analogy to the previous section. 

In the limit of a large particle number N, the sums over k can be replaced 
by integrals and a different analytic behavior may result:'4 


3 2 
ie Is (10.3.18) 


Pao ae ieee 


3 
D(z) = —M2?+MaQ?4" i; 
m J ( 


The integral over k spans the first Brillouin zone: —-* < kj < *. For a simple 


evaluation of the integral over k, we replace the region of integration by a 
sphere of the same volume having a radius A = (ay? . 2n and substitute 
12 In the quantum-mechanical treatment, we would have to use the expectation 
value of (10.3.14) instead and insert (qn(0)) = (qn(0)) = 0. In problem 10.6, the 
force on the oscillator due to the lattice particles is investigated when the latter 
are in thermal equilibrium. 

The poles of Q(z), z =w+in are real, i.e. they lie in the complex w-plane 
below the real axis. (10.3.17) follows with the residue theorem by closure of the 
integration in the lower half-plane. 

In order to determine what the ratio of t and N must be to permit the use of 
the limit N — oo even for finite N, the N—dependence of the poles wy must be 
found from D(z) = 0. The distance between the poles wy is Aw, ~ x, and the 
values of ns residues are of O (x ). The frequencies w, obey wy+1 — ~ mae 
Fort < ~, the phase factors e'“”* vary only weakly as a ace of v, and 


the sum over v in (10.3.17) can be replaced by an integral. 


13 


14 
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the dispersion relation by w~ = c|k| where c is the velocity of sound. It then 
follows that 


3 Ac 2 3 Ac 
a 1 dv v a 1 
| 5——s lev)? = —a5 - dv|c(v)|?-+ 
0 =a 0 


m 2723 z m 27283 
© dy |e(v)/? © dy |c(v)|? 
+ 2 | oo ei = 2 f pore a (10.3.19) 


with v = c\k|. We now discuss the last equation term by term making use of 
the simplification |c(v)|? = g? corresponding to Cn = g6n,0- 
1st term of (10.3.19): 
| 
m 27203 


Ac 
i dv|c(v)|? = —g*Ac. (10.3.20) 
0 


This yields a renormalization of the oscillator frequency 


ae 
ee |@ g Ac ie (10.3.21) 


2nd term of (10.3.19) and evaluation using the theorem of residues: 


ar 92 f° aw : 
wie gz | =) = oe =—-MTiz (10.3.22) 
2,3 
ga 1 m 
=f a= “4a: (10.3.23) 


The third term of (10.3.19) is due to the high frequencies and affects the 
behavior at very short times. This effect is treated in problem 10.5, where 


a continuous cutoff function is employed. If we neglect it, we obtain from 
(10.3.16) 


(—2? + @? —iz) Q(z) = M(Q(0) —izQ(0)) , (10.3.24) 


and, after transformation into the time domain for t > 0, we have the follow- 
ing equation of motion for Q(t): 


The coupling to the bath of oscillators leads to a frictional term and to 
irreversible damped motion. For example, let the initial values be Q(0) = 0, 


Q(t = 0) = Q(0) (for the lattice oscillators, we have already set qn(0) = 
gn(0) = 0); then from Eq. (10.3.24) it follows that 


° dw _ e#*Q(0) 
Qn —2z24+ 0% —if'z 


aia) = | (10.3.26) 


—co 
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and, using the theorem of residues, 


elt /2 sin Wot - 


Q(t) = Q(0) , (10.3.27) 


WO 


with wo = \/w* — =. 
The conditions for the derivation of the irreversible equation of motion 
(10.3.25) were: 


a) A limitation to times t < a, This implies practically no limitation 
for large N, since the exponential decay is much more rapid. 
b) The separation into macroscopic variables = massive oscillator (of mass 
M) and microscopic variables = lattice oscillators (of mass m) leads, 
owing to 77 < 1, to a separation of time scales 


<< Wmax 5 r< Wmax - 


The time scales of the macroscopic variables are 1/2, 1/I. 

The irreversibility (exponential damping) arises in going to the limit N > 
oo. In order to obtain irreversibility even at arbitrarily long times, the limit 
N — o must first be taken. 


10.3.2 Microscopic Time-Reversible and Macroscopic Irreversible 
Equations of Motion, Hydrodynamics 


The derivation of hydrodynamic equations of motion (Appendix H.) directly 
from the microscopic equations is based on the following elements: 


(i) The point of departure is represented by the equations of motion for 
the conserved quantities and the equations of motion for the infinitely many 
nonconserved quantities. 

(ii) An important precondition is the separation of time scales ck X wWy.c., 
i.e. the characteristic frequencies of the conserved quantities ck are much 
slower than the typical frequencies of the nonconserved quantities wy.¢., anal- 
ogous to the w) (A > 5) in the Boltzmann equation, Sect. 9.4.4. This permits 
the elimination of the rapid variables. 


In the analytic treatment in Appendix H., one starts from the equations of 
motion for the so called Kubo relaxation function ¢ and obtains equations of 
motion for the relaxation functions of the conserved quantities. From the one- 
to-one correspondence of equations of motion for ¢ and the time-dependent 
expectation values of operators under the influence of a perturbation, the 
hydrodynamic equations for the conserved quantities are obtained. The re- 
maining variables express themselves in the form of damping terms, which 
can be expressed by Kubo formulas. 


'® These times, albeit long, are much shorter than the Poincaré recurrence time. 
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*10.4 The Master Equation and Irreversibility 
in Quantum Mechanics"® 


We consider an isolated system and its density matrix at the time t, with 
probabilities w;(t) 


o(t) = ys wy(t) |t) (| . (10.4.1) 


The states |i) are eigenstates of the Hamiltonian Ho. We let the quantum 
numbers 7 represent the energy FE; and a series of additional quantum num- 
bers v;. A perturbation V also acts on the system or within it and causes 
transitions between the states; thus the overall Hamiltonian is 


H=Ho+V. (10.4.2) 


For example, in a nearly ideal gas, 7g could be the kinetic energy and V 
the interaction which results from collisions of the atoms. We next consider 
the time development of g on the basis of (10.4.1) and denote the time- 
development operator by U(r). After the time 7 the density matrix has the 
form 


= 2 d wilt) |9) Gi] O) |e) UN) |B) (A (10.4.3) 


where the matrix elements 
Uji(7) = (i U(r) |t) (10.4.4) 


have been introduced. We assume that the system, even though it is prac- 
tically isolated, is in fact subject to a phase averaging at each instant as a 
result of weak contacts to other macroscopic systems. This corresponds to 
taking the trace over other, unobserved degrees of freedom which are coupled 
to the system!”. Then the density matrix (10.4.3) is transformed to 


16 W. Pauli, Sommerfeld Festschrift, S. Hirzel, Leipzig, 1928, p. 30. 

'’ If for example every state |j) of the system is connected with a state |2, 7) of 
these other macroscopic degrees of freedom, so that the contributions to the total 
density matrix are of the form |2, 7) |j) (k| (2,4| , then taking the trace over 2 
leads to the diagonal form |j) (j|. This stochastic nature, which is introduced 
through contact to the system’s surroundings, is the decisive and subtle step in 
the derivation of the master equation. Cf. N.G. van Kampen, Physica 20, 603 
(1954), and Fortschritte der Physik 4, 405 (1956). 
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2 ew 19) (| Uje(7 UF (7) . (10.4.5) 


Comparison with (10.4.1) shows that the probability for the state |7) at the 
time t+ 7 is thus 


(t+ 7) aos wi (t)|U ji ( ; 
and the change in the probability is 


w;(t+7) — w;(t) = Yo (wilt) — wy (8) Oj(7)/? , (10.4.6) 


4 


where we have used )>; |U;;(7)|? = 1. On the right-hand side, the term i = j 
vanishes. We thus require only the nondiagonal elements of U;;(7), for which 
we can use the Golden Rule’® 


1 /sinw;; 7/2 ‘ . . 
Wnt = 5s (SP) laivinr 


ees —-- . ae! 
a5 = 125(Ei — By) IV) | 


nh 
(10.4.7) 


with wi; = (E; — E;)/h. The limit of validity of the Golden Rule is AE > 
oEt > de, where AF is the width of the energy distribution of the states and 
dé is the spacing of the energy levels. From (10.4.6) and (10.4.7), it follows 
that 
dw; (t) 20 : 
= (wilt) — 5 (t)) 6B: — B;)| GV |) ? 


a 


As already mentioned at the beginning of this section, the index i = (E;,1%) 
includes the quantum numbers of the energy and the 1, the large number of 
all remaining quantum numbers. The sum over the energy eigenvalues on the 
right-hand side can be replaced by an integral with the density of states o(E£;) 
according to 


Sov f ab eB) 


so that, making use of the 6-function, we obtain: 
dwe;v; (t) 27 
OE = wo. — WE; OCB] (Ej, 4 V LB}. P - (10.4.8) 


yi 


8 QM I, Eq. (16.36) 
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With the coefficients 


QT 
AB, Vivi = 5 (EI (E;,uj|V|E;,%) |? , (10.4.9) 


Pauli’s master equation follows: 


wes a 
deveus(E) = Dos vii (WE; 4 (t) — WE; v,(t)) - (10.4.10) 


This equation has the general structure 


where the transition rates Wy» = Wnn obey the so called detailed balance 
condition!® 

Waid, = Wand, (10.4.12) 
for the microcanonical ensemble, p;4 = p%4 for all n and n’. One can show in 


general that Eq. (10.4.11) is irreversible and that the entropy 


= — pn log pn (10.4.13) 


increases. With (10.4.11), we have 


5 el Ss log pn)’ Wr n(Dn/ = Dn) 


n,n! 


= ye Wri n Pn’ ((Pn log pn)’ = (Pni log pn’)’) . 


n,n! 


By permutation of the summation indices n and n’ and using the symmetry 
relation Wyn = Wr n, we obtain 


1 
= 3 S- Warn’ (Pni — Pn) (Pn log pnv)’ = (Pn log pn)’ ) >0, (10.4.14) 


n,n! 


where the inequality follows from the convexity of xlogax (Fig. 10.4). The 
entropy continues to increase until py = py’ for all n and n’. Here we assume 
that all the n and n’ are connected via a chain of matrix elements. The 
isolated system described by the master equation (10.4.10) approaches the 
microcanonical equilibrium. 


' See QM II, following Eq. (4.2.17). 
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ey zlogz 


Fig. 10.4. The function 
f(x) = xlogz is convex, 


(2'—a)(f"(@"')—f'(a)) > 0 


10.5 Probability and Phase-Space Volume 


*10.5.1 Probabilities and the Time Interval of Large Fluctuations 


In the framework of equilibrium statistical mechanics one can calculate the 
probability that the system spontaneously takes on a constraint. In the con- 
text of the Gay-Lussac experiment, we found the probability that a system 
with a fixed particle number N with a total volume V would be found only 
within the subvolume V, (Eq. (3.5.5)): 


W(E,VY1) =e SEY)-SEM))/E (10.5.1) 


For an ideal gas”°, the entropy is S(E,V) = kN(logxz4¢ + 3). Since 
ae 


B= 3N kT, Av remains unchanged on expansion and it follows that 


W(E,Vi) = e NS. This gives log = log ess ~ aa for low 


Vv 
Vi 
larger. The dependence on N ~ 107° is dominant in Eq. (10.5.1) for macro- 
scopic systems. For general constraints, from Eq. (3.5.5) we obtain 


compressions. At higher compressions, Vi << V, the factor log = becomes 


W(E, Z) =e SE)-SBZ))/k (10.5.2) 


As an example, we consider in more detail the density fluctuations in an 
ideal gas. The N gas atoms are distributed between the subvolume V; and 
the subvolume V — V, with the probabilities p = 4 and 1 — p corresponding 
to a binomial distribution?!. The mean square deviation of the particle num- 
ber (An)? within the subvolume Vj is related to the average of the particle 
number in this volume, n = pN: 


20 Chapter 2 
71 Sect. 1.5.1 
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An)? 1 A 1 
ae se ee, (10.5.3) 
n n n Jn 
For 7 = 10?°, this ratio is (An) = 10-'°. For large N, n and N — n, from 


the binomial distribution we obtain the probability density for the particle 
number n by expansion and using the Stirling approximation; it is a Gaussian 
distribution 
1 (nan)? 
w(n) = e 2, 10.5.4 
(0) = = (10.5.4) 


This last formula is also a result of the central limit theorem??. The proba- 
bility density is normalized to 1: [ dnw(n) = 1. 
We are now interested in the case that n is greater than the value n+ dn, 


i.e. in the occurrence of fluctuations which are greater than a given dn. Its 
probability is obtained by integrating (10.5.4) over the interval [mi + dn, co] 


1 oS 2 1 eS 
dv e 2% dze , 


V2nin on Vn bn/V2h% 
2 ; (1 -0()) (10.5.5) 


with the error integral 


ws (dn) = 


G(x) = =f dye’ , (10.5.6) 


where already for values x = 1 the approximation formula $(1 —@(x)) & 
o-* 


dJae May be employed. 


In Fig. 10.5, we show a possible time dependence of the particle number 
n(t). Knowing the probability for the occurrence of a fluctuation, we can also 
make predictions about the typical time interval between fluctuations. First 
of all, the probability ws(dn) has the following meaning in terms of the time: 
denoting by ts, the overall time during which n(t) remains above ni + dn 
during the time period to of observation, then we find 


ie. (10.5.7) 
to 

We are interested in the average time required until a density increase which 
is greater than dn occurs, and term it the waiting time Js5,. Here, we must 
also introduce the time 7) which is required for a fluctuation, once it occurs, 
to again degrade. Typically, 79 = L/c, where L is the linear dimension of the 
system and cis the velocity of sound, e.g. L = 1cm, ¢ = 10° cm/s, 7 = 107°s. 
The time introduced above, t5,, within which the deviation exceeds én, is 


22 Sect. 1.2.2 


496 10. Irreversibility and the Approach to Equilibrium 


n(t) 
bn 
n 
Fig. 10.5. <A_ possible 
t time dependence of the 
particle number’ within 
0 to the subvolume V1 


tsn = T) X (the number of fluctuations above dn). For relatively large én, 
to — tsn © to and thus the waiting time V5, is equal to the ratio of to to the 
number of fluctuations larger than 6n 


WO ps a My ofl a 
ce ta” ~ -w5(dn) =n/5 (1 o( a 5)) , (10.5.8) 


where (10.5.5) was used. 
As an illustration, the waiting times until a density fluctuation upwards 


of more than én occurs are collected in Table 10.1 for several values of the 


relative deviation ai and 2 = on % for n = 107°. Deviations smaller than 


3 x 10~!° regularly occur; those larger than 7 x 10~!° never occur! And even 
these in fact small deviations (7 x 10~'°) are macroscopically unobservable. 
This small interval encompasses the whole range from frequent to never. 
Now we return to the fluctuation curve (Fig. 10.5). We consider a rela- 
tively large dn. To the right of the maximum, there is a monotonic decrease. 
There are just as many intersection points to the left of the maximum as to 
its right. For large én, the intersection point must in fact lie on the maximum. 
But to the left of the maximum, n(t) must have increased; this is so improb- 
able that it practically never occurs. In practice, such an improbable state is 
never generated by a spontaneous fluctuation, but instead is determined as 
the initial state in an experiment by the lifting of constraints. Starting from 


Table 10.1. Waiting Times 


z 3(1—@(2x)) Relative deviation “ Waiting time Jsn = oo 
n gU-P(x 

3  1x107° 3x 1071° 1s 

5 8xio 5 x 10710 1.3 x 10° s =5 months 


7 2x 1073 7x 10719 5x 10'7 s=2 x 10’° years 
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such an improbable configuration, the system always relaxes towards n and 
never assumes an even more improbable state, e.g. an even greater density 
on expansion of a gas. Small fluctuations are superposed onto this relaxation. 


10.5.2 The Ergodic Theorem 


The ergodic theorem played an important role in classical physics, providing 
the justification for the statistical description of matter. It was proposed 
by Boltzmann, and states that an arbitrary trajectory passes through every 
point in phase space and that therefore an average over time is identical with 
an average over phase space. In this form, the theorem is not tenable; rather, 
its must be modified as follows (it is then referred to as the quasi-ergodic 
theorem): every trajectory, apart from a set of null measure, approaches each 
point in phase space arbitrarily closely. This also yields the result that a 
time average is equal to an ensemble average. If a system is ergodic?®, i.e. the 
quasi-ergodic theorem applies to it, then the following expression holds for 
functions in phase space: 


ae 1 
lim — dt t t))==>=>——_— Dr : 10.5. 
jim 7 fat Halt.) = aap fat tan): (10.5.9) 
“time average is equal to ensemble average”. 
Choosing f(q,p) = O(¢,p € G), where G is a region of the energy shell, 
we obtain from (10.5.9) in the limit of long T 


ta _ |Iel 
oR? (10.5.10) 


where 7g is the time during which the trajectory remains within G and |g] is 
the phase-space volume of the region G. It is thus the fraction of time which 
the system spends in G, equal to the ratio of the volumes in phase space. 
The time spent by a system within an improbable region (in an atypical 
configuration) is short. 


23 T,, Boltzmann, as well as P. and T. Ehrenfest, associated something more like 
the idea of what we would now call “mixing” with the concept “ergodic”, which 
they also circumscribed as “stirred apart” (Ger. ‘zerriihren’). 
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10.6 The Gibbs and the Boltzmann Entropies and their 
Time Dependences 


10.6.1 The Time Derivative of Gibbs’ Entropy 


The microscopic entropy of an isolated system with the Hamilton H, intro- 
duced in Sect. 2.3, which in the present connection we call Gibbs’ entropy Se, 


Sq =—k Tr (log g) (10.6.1) 


does not change in the course of time. The von Neumann equation for this 
system is given by 
6= [0,7] , (10.6.2) 
so that 
—Sq/k = Tr (olog 9)’ = Tr (log 9) + Tré 
= ir ({o, H] log 0) = a (log 0, eo] H) =0. 


We have used Tr @ = 0, which follows from Tre = 1. In this derivation, 
H may be time dependent, e.g. H could contain a time-dependent external 
parameter such as the available volume, fixed by the wall potentials. The 
density matrix varies with time, but 


Sg =0 (10.6.3) 


still holds. The Gibbs entropy remains constant and yields no information 
about the irreversible motion; it has significance only for the equilibrium 
state of the system. 

The derivation of Se = 0 from classical statistics with 


Sq =—k f ar ologo, 


making use of the Liouville equation, is the topic of problem 10.7. This result 
is based on the fact that a particular region in phase space does indeed change, 
but its volume remains the same according to Liouville’s theorem. 


10.6.2 Boltzmann’s Entropy 


To each macrostate?+ M and thus to each microstate X or each point in 
I’-space which represents the macrostate, M = M(X), a Boltzmann entropy 
Sp can be associated: 


24 See Chaps. 1 and 2. 
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Sp(M) = k log ava ; (10.6.4) 


where Iyy is the phase-space region of M and |Iyy| is its volume. 
The Gibbs entropy Sq is defined for an ensemble with the distribution 
function o(X) by 


Selo = -k far o(X) log o( X) . (10.6.5) 


For a microcanonical ensemble (Sect. 2.2), we have 


_ \Cu|~? X€Ely 
moe 0 otherwise 


In this case, 
Sclemc] = klog|Pu| = SB(M) . (10.6.6) 


holds. In equilibrium, the Boltzmann and the Gibbs entropies are thus equal. 
More generally, the two entropies are equal when the particle density, the 
energy density, and the momentum density vary only slowly on a microscopic 
scale, and the system is in equilibrium in every small macroscopic region, 
ie. it is in a state of local equilibrium. When however the system is not in 
complete equilibrium, in which case M and @ would no longer change, then 
the time developments of Sp and Sc, even starting from local equilibrium, 
are quite different. As we have shown, Sc remains constant, while Sg(M) 
changes. Let us consider e.g. the expansion of a gas. Initially, Sg = Sc. 
Then, typically, Sp increases while Sg remains constant, and from Sg alone 
the tendency towards equilibrium is not at all apparent. This is due to the 
fact that the size of the volume of phase space remains the same as in the 
initial state throughout the entire time development. 


*10.6.2.1 Boltzmann’s Calculation of Sg and its Connection with 
the p-Space of the Boltzmann Equation?° 


We consider a dilute gas with N particles and introduce a division of ju-space into 


cells w1,we,... of size |w|, which we enumerate by the index i. Let the cell i be 
occupied by n; particles. The volume in phase space is 
N! N 
| = ———_ 10.6.7 
iS saree ct ( ) 


see below. From this, using Stirling’s approximation, we obtain 


log || & N log N — So ni log ni + N log |w| , (10.6.8) 


25 P. and T. Ehrenfest, op. cit. 2; 


M. Kac, Probability and Related Topics in Science, Interscience Publishers, Lon- 
don, 1953 
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and for Boltzmann’s entropy, 

Sp =klog|l|. 

The relation to the distribution function f and Boltzmann’s H-function: nj; = 
tin d°azd°v f is the number of particles in cell i. In the case that f(x, v) varies slowly 
(it is a smooth function), then ni = |w| x f(in cell 7) = f(in cell 1) = 26 The wi 
are taken to be small; on the other hand, the n; have to be so large that Stirling’s 


approximation may be used. For the H-function which we introduced in connection 
with the Boltzmann equation, we then obtain 


nm 


Aiot = [ee d°v f log f & Son log = So ni logn; — Nlog|w|. (10.6.9) 


|u| 
Comparison with Eq. (10.6.8) for a dilute gas (with negligible interactions) then 
yields 


Sp = —k Atot , (10.6.10) 


apart from the term N log N, which is independent of the configuration. 

Demonstration of the formula (10.6.7) for |I"|: For each point in I’-space, there 
are associated image points of the N molecules in p-space. For example: the kth 
molecule has the image point m*). To each point in I’-space, there corresponds a 
distribution n1,n2,.... On the other hand, to each distribution of states n1,n2,... 
there corresponds a continuum of I’-points. (i) Each of the image points can be 
arbitrarily shifted within the cell w; in which it is located. This gives the volume 
|u|" in IP. (ii) Let a point in P-space be given. Every permutation of the image 
points leads to a I’-point with the same distribution ni,n2,... All together, there 
are N! permutations. (iii) Permutations which only exchange the image points 
within a cell have already been taken into account in the shift permutations (i). For 
every I’-point, there are ni! ng! ... such permutations. The number of permutations 
which lead to new image points (“combinations”) is therefore ar: As we know 
from the discussion of Gibbs’ paradox, the term N log N in (10.6.8) is no longer 
present after division by N!, the “correct Boltzmann counting”, as follows from 
quantum statistics (Bose and Fermi statistics). 


10.7 Irreversibility and Time Reversal 


10.7.1 The Expansion of a Gas 


We now have acquired the necessary fundamentals to be able to discuss the 
expansion of a gas and “Loschmidt’s paradox”, which results from time re- 
versal’. We assume that initially, at time t = 0, the gas occupies only a sub- 
volume V; of the total volume V, and the velocities correspond to a Maxwell 


6 This is certainly fulfilled in local equilibrium. 

27 As an illustration, we refer to a computer experiment with N = 864 atoms, 
which interact via a Lennard—Jones potential (Eq. (5.3.16)); see Fig. 10.1. The 
time development is determined by methods of molecular dynamics, i.e. by nu- 
merical solution of the discretized Newtonian equations (B. Kaufmann, Master’s 
Thesis, TU Miinchen, 1995). The times are given in units of a characteristic time 


/mo2/e = 2.15 x 107” sec for the potential and the mass (argon) considered. 
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distribution. One can imagine that this initial state is produced by removal of 
previously present separating partitions?®. The time which the particles typ- 
ically require in order to pass through the volume ballistically is 7 = £ For 
L=1 cm and v = 10° cm sec™!, we find t9 = 107° sec. In comparison, the 
collision time is very short, T + 107° sec. It thus typically requires several 79 
for the gas to spread out ballistically (with reflections by the walls) within 
the total volume. After about 10 79, the gas is uniformly distributed over 
the whole volume and can be considered to be in equilibrium in macroscopic 
terms. 

In Fig. 10.1, the configurations in the computer simulation?’ are shown 
for the times t = 4.72, 14.16, and 236. This expansion is accompanied by 
a monotonic increase of the “coarse-grained” Boltzmann entropy, which is 
based only on the spatial distribution of the particle-number density, n(x): 


S= - [ n(x, t) log n(x, t) , (10.7.1) 


which attains its equilibrium value after t = 50. This observed behavior is 
exactly as predicted by the Boltzmann equation. The entropy increases mono- 
tonically, cf. Fig. 10.6(1)?9. However, if one reverses all the particle velocities 
in this gas at a particular time t, whatever time has elapsed, then all the 
particles will return along their original paths and after a further time in- 
terval ¢ will again arrive at their initial configuration, and the entropy will 
decrease to its initial value; cf. Fig. 10.6(2). Although the gas represented in 
Fig. 10.1 c) appears completely disordered with respect to its spatial distri- 
bution, and no special features can be seen in its motion along the positive 
direction of time, nevertheless due to the special initial state from which it 
came, which occupies only a fraction of the phase space and is subject to 
a high degree of spatial constraints, it is in a state which contains subtle 
correlations of the particle velocities. Following a time reversal v — —v, the 
particles move in a “conspiratorial” fashion, so that they finally all come back 


28 In the following discussion, we shall denote the initial state in the subvolume Vj 
by X, the microstate after the time t by 7; X, and the time-reversed state by 
TT, X. We have T; T T; X =T X. Here, T; denotes the time development oper- 
ator for the time interval t, and T is the time reversal-operator, which reverses 
all velocities. 

As already emphasized at the end of Sect. 10.6, the Gibbs entropy Sq remains 
constant and gives no indication of the irreversible expansion. This is due to the 
fact that the size of the phase-space volume of the initial states remains constant 
with time. These microstates are however no longer typical of the macrostate 
M(t) (or of the local equilibrium state) which is present for t > 0. The phase 
space of these states is the same size as the phase space at the time t = 0; 
it is thus considerably smaller than that of all the states which represent the 
macrostate at times t > 0. The state 7; X contains complex correlations. The 
typical microstates of M(t) lack these correlations. They become apparent upon 
time reversal. In the forward direction of time, in contrast, the future of such 
atypical microstates is just the same as that of the typical states. 


29 
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Time reversal 


0 50 100 150 200 


Fig. 10.6. The entropy as a function of time in the expansion of a computer 
gas consisting of 864 atoms. In the initial stages, all the curves lie on top of one 
another. (1) The unperturbed expansion of V; to V (solid curve). (2) Time reversal 
at t = 94.4 (dashed curve), the system returns to its initial state and the entropy 
to its initial value. (3) A perturbation * at t = 18.88 and time reversal at t = 30.68. 
The system approaches its initial state closely (dotted curve). (4) A perturbation * 
at t = 59 and time reversal at t = 70.8 (chain curve). Only for a short time after the 
time reversal does the entropy decrease; it then increases towards its equilibrium 
value. 


together within the original subvolume”’. It is apparent that the initial state 
which we defined at the beginning leads in the course of time to a state which 
is not typical of a gas with the density shown in Fig. 10.1 c) and a Maxwell 
distribution. A typical microstate for such a gas would never compress itself 
into a subvolume after a time reversal. States which develop in such a cor- 
related manner and which are not typical will be termed quasi-equilibrium 
states®!, also called local quasi-equilibrium states during the intermediate 
stages of the time development. Quasi-equilibrium states have the property 
that their macroscopic appearance is not invariant under time reversal. Al- 
though these quasi-equilibrium states of isolated systems doubtless exist and 
their time-reversed counterparts can be visualized in the computer experi- 
ment, the latter would seem to have no significance in reality. Thus, why was 
Boltzmann nevertheless correct in his statement that the entropy Sp always 
increases monotonically apart from small fluctuations? 


30 The associated “coarse-grained” Boltzmann entropy (10.7.1) decreases following 
the time reversal, curve (2) in Fig. 10.6. A time dependence of this type is not 
described by the Boltzmann equation and is also never observed in Nature. 

31 J.M. Blatt, An Alternative Approach to the Ergodic Problem, Prog. Theor. 
Phys. 22, 745 (1959) 

32 We must point out an unrealistic feature of the computer experiments here. The 
sample is so small that within an equilibration time 1070 only a few collisions 
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One must first realize that the number of quasi-equilibrium states X which 
belong to a particular macrostate is much smaller than the number of typical 
microstates which represent this macrostate. The phase-space volume of the 
macrostate M with volume V is |Iy7(v)|. In contrast, the phase-space volume 
of the quasi-equilibrium states is equal to the phase-space volume |Iyy(y,)| 
from which they derived by expansion and to which they return after a time 
reversal, and |I’yv,)| < |I'v)|. This means that if one prepares a system 
in a particular macrostate, the microstate which thereby appears will never 
of its own coincidentally or intentionally be one of the time-reversed quasi- 
equilibrium states such as T T; X. The only possibility of generating such an 
atypical state is in fact to allow a gas to expand and then to reverse all the 
particle velocities, i.e. to prepare J T; X. Thus one could refute Loschmidt’s 
paradox by making the laconic remark that in practice (in a real experiment) 
it is not possible to reverse the velocities of 107° particles. There is however an 
additional impossibility which prevents states with decreasing entropy from 
occurring. We have so far not taken into account the fact that in reality, it 
is impossible to produce a totally isolated system. There are always external 
perturbations present, such as radiation, sunspots or the variable gravita- 
tional influence of the surrounding matter. The latter effect is estimated in 
Sect. 10.7.3. If it were in fact possible to reverse all the velocities, the entropy 
would indeed decrease for a short time, but then owing to the external per- 
turbations the system would within a very short time (ca. 10 7) be affected 
in such a way that its entropy would again increase. External perturbations 
transform quasi-equilibrium states into more typical representatives of the 
macrostate. Even though external perturbations may be so weak that they 
play no role in the energy balance, still quantum mechanically they lead to a 
randomization of the phases and classically to small deviations of the trajecto- 
ries, so that the system loses its memory of the initial state after only a small 
number of collisions. This drastic effect of external perturbations is closely 
connected with the sensitive dependence on the initial conditions which is 
well known in classical mechanics and is responsible for the phenomenon of 
deterministic chaos. 

In curves 3 and 4 of Fig. 10.6, the system was perturbed at the times t = 23.6 
and t = 59 and thereafter the time-reversal transformation was carried out. The 
perturbation consisted of a small change in the directions of the particle veloci- 
ties, such as could be produced by energetically negligible gravitational influences 
(Sect. 10.7.3). For the shorter time, the system still closely approaches its original 
initial state and the entropy decreases, but then it again increases. In the case of 
the longer time, a decrease of the entropy occurs for only a brief period. 

It is intuitively clear that every perturbation leads away from the atypical 
region, since the phase space of the typical states is so much larger. This 
also implies that the perturbations are all the more efficient, the closer to 
quasi-equilibrium the state is. For, considered statistically, the imbalance of 


occur. The decrease of the entropy following a perturbation and time reversal 
is due primarily to atoms which have undergone no collisions at all. This is the 
reason for the great difference between curves (3) and (4). 
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the number of typical and of atypical states is then all the greater and the 
probability of a perturbation leading to a much more typical state is higher. 
More significant, however, is the fact that the number of collisions undergone 
by the particles of the system increases enormously with time. Per 79, the 
number of collisions is t/t ~ 10°. And all of these collisions would have 
to be run through in precisely the reverse direction after a time-reversal 
transformation in order for the initial state to again be reached. 


Remarks: 


(i) Stability of the irreversible macroscopic relaxation: the weak perturbations 
investigated have no effect on the macroscopic time development in the future 
direction. The state 7;.X is converted by the perturbation to a more typical state, 
which relaxes further towards equilibrium (quasi-equilibrium) just like T,X. The 
time reversal of such a more typical state leads however to a state exhibiting an 
entropy decrease for at most a short time; thereafter, the entropy again increases. 
(If one conversely first performs the time reversal and then applies the perturbation, 
the effects are still similar.) In the time development T, (T;X ), the particles spread 
apart spatially; in I’-space, X moves into regions with all together a larger phase- 
space volume. This is not changed by external disturbances. For T;/(77T;X), the 
particles move together into a more confined region. All velocities and positions 
must be correlated with one another in order for the improbable initial state to be 
produced once again. In the forward direction, the macroscopic time development 
is stable with respect to perturbations but in the time-reversed direction, it is very 
unstable. 

(ii) Following Boltzmann’s arguments, the explanation of irreversiblilty is prob- 
abilistic. The basic laws of physics are not irreversible, but the initial state of the 
system in an expansion experiment as described above is very peculiar. This ini- 
tial state is quite improbable; this means that it corresponds to only a very small 
volume in phase space and to a correspondingly small entropy. Its time develop- 
ment then leads into regions with a large total volume (and also greater entropy), 
corresponding to a more probable macrostate of the system with a longer dwell 
time. In principle, the system would return to its improbable initial state after an 
unrealistically long time; however, we will never observe this. As soon as the limit 
of infinite particle number is introduced into the theory, this recurrence time in fact 
tends towards infinity. In this limit, there is no eternal return and one has complete 
irreversibility. 

(iii) The significance of the external perturbations for the relaxation towards 
equilibrium instead of only towards quasi-equilibrium goes hand in hand with the 
justification in Chap. 2 of the necessity of describing real systems in terms of sta- 
tistical ensembles. One could also ask what would happen in an idealized strictly 
isolated system. Its microstate would develop in the course of time into a quasi- 
equilibrium state which in terms of its macroscopic behavior would be indistin- 
guishable from the typical microstates of the macrostate formed. In this situation, 
one could for convenience of computation still employ a density matrix instead of 
the single state. 


We have completed the most important considerations of the irreversible 
transition towards equilibrium and the accompanying increase in the Boltz- 
mann entropy. The next sections contain some additional observations and 
numerical estimates. 
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10.7.2 Description of the Expansion Experiment in p-Space 


It is instructive to describe the expansion experiment with an isolated gas 
also in Boltzmann’s p-space, and to compare the Gibbs and the Boltzmann 
entropies in detail; cf. Fig. 10.7. In the initial state, all the gas atoms are in 
the small subvolume Vj. The single-particle distribution function is uniform 
within this volume and vanishes outside it. The particles are to a large extent 
uncorrelated, i.e. the two-particle distribution function obeys 
fo(X1, V1, X2, V2) — f(x1, v1) f(xe, V2) = 0, and higher correlation functions 
vanish as well. During the expansion, f spreads throughout the entire vol- 
ume. As we have already mentioned in connection with the Stosszahlansatz 
for deriving the Boltzmann equation, two colliding particles are correlated af- 
ter their collision (their velocities are such that they would once again collide 
if the gas were subjected to a time reversal = motion reversal). The informa- 
tion contained in the initial state: “all particles in one corner”, i.e. the dis- 
tribution which is concentrated within V; (and thus is spatially constrained) 
shifts to produce subtle correlations of the particles among themselves. The 
longer the time that passes, the more collisions will have occurred, and the 
higher the order of the correlation functions which take on nonzero values. All 
this information is contained in the time-dependent N-particle distribution 
function e(x1,V1,...Xx,Vw,t) on which the Gibbs entropy is based. On the 
other hand, in the Boltzmann entropy only the macroscopic manifestation, 
in the simplest case the single-particle distribution function, is considered. 
The Boltzmann entropy increases. 

The time which is typically required by the particles in order to pass 
through the volume ballistically is 7 = £, For L = 1 cm and v = 10° 
cm sec~!, 7 = 107° sec. In comparison, the collision time tT = 107° sec is 
much shorter. It thus typically requires several 7) for the gas to spread out 


i = f 
o'(x1, P1,---, XN, Pw) = o' (x1, Pi,-.-,XN, Pw) 
S& = SG 
i f 
f'(x, Pp) = f (xP) 
Sp < St 
particles uncorrelated many-body correlations 


Fig. 10.7. Expansion experiment: N-particle and single-particle distribution func- 
tions p and f, the Boltzmann and Gibbs entropies of the initial state i and the final 
state f 
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throughout the whole volume ballistically (with reflections from the walls). 
After around 10 79, about 10%10"° = 10° collisions have also taken place. Even 
after this short time, i.e. 10~* sec, the initial configuration (all the particles 
uncorrelated within a corner of the total volume) has been transferred to 
correlation functions of the order of 10000 particles. 


10.7.3 The Influence of External Perturbations on the 
Trajectories of the Particles 


In the following section, the influence of an external perturbation on the 
relative motion of two particles is estimated, along with the change in the 
collisions which follow such a perturbation. We consider two particles which 
collide with each other and investigate the effect of an additional external 
force on their relative distance and its influence on their trajectories. The 
two atoms are presumed to be initially at a distance ] (mean free path). 
Owing to the spatial variation of the force F, it acts differently on the two 
atoms, AF = F, — F2. The Newtonian equation of motion for the relative 


coordinate Ax = SE leads to Ax & aFi and finally to 
AF? — AF /1\’ 
Ax & xy — (=) : (10.7.2) 
m m \v 


This yields an angular change in the trajectory after a path of length | of 


A AF| 1 
Ad = ial Ex Rasa : (10.7.3) 


l m v2 


Even if this angular change is very small, it will be amplified by the collisions 
which follow. In the first collision, there is a change in the deflection angle of 
Av, = LAY (Fig. 10.8), where it must be kept in mind that | > r,. Here, r. 
is the range of the potentials or the radius of a hard sphere. After k collisions, 
the angular change is 


k 
Ad, = (=) Ad. (10.7.4) 


The condition that the perturbed trajectory have no connection to the un- 
k 
perturbed one is given by AJ; = 2a = (+) Av. It follows from this that 


k= — Ad. (10.7.5) 


We consider for example the influence of an experimenter of mass M = 
80kg at a distance of d = 1m on a sample of helium gas (1 mole) due 


“10.8 Entropy Death or Ordered Structures? 507 


Ad Ad; Fig. 10.8. Av, = + Av 


to the gravitational force, W = — iN, G = 6.67 x 107 mekg~'s~?, 
N =6~x 1073, m = 6.7 x 10-73 g. The additional energy W ~ —2 x 10719 J is 
negligible compared to the total energy of the gas, F + 3kJ. The difference in 
the force on the two particles spaced a distance | apart owing to the additional 
mass M is 


GMm GMm _ GMml 


AF| = x 
ee aae (d+1)? a’ 


and the resulting angular change is 


Ad 


_GMml 1 _ GM (1\? 
~ Bo mv BB (<) , 

For the numerical values given, one finds A’ ~ 4 x 107-78. To determine 
the number of collisions which lead to completely different trajectories, this 
value is inserted into equation (10.7.5) along with | ~ 1400 A andr, # 1.5 A 
resulting in k = 10. In spite of the smallness of AV, due to the logarithmic 
dependence a relatively small number of collisions is sufficient. Even much 
smaller masses at much greater distances lead to a similarly drastic effect. 
Energetically completely negligible perturbations lead to a randomization of 
the trajectory. 


*10.8 Entropy Death or Ordered Structures? 


We close with some qualitative remarks on the consequences of the global 
increase of the entropy. Boltzmann himself considered the evolution of the 
cosmos and feared that it would end in a state of thermal equilibrium (heat 
death). Our Earth and the surrounding cosmos show no signs of this: (i) How 
does the extreme thermal non-equilibrium within the galaxies come about? 
(ii) What allows the existence of ordered and highly organized structures on 
our planet? (iii) Where will further evolution lead? 

In its early period at temperatures above 3000 K, the Universe did not 
consist of galaxies and stars but rather of an ionized and undifferentiated 
soup of matter and radiation. By the time the temperature of the Universe 
had decreased to 3000 K (around 300 000 years after the Big Bang), nucleons 
and electrons began to bind together to form atoms. Matter then became 
transparent to electromagnetic radiation. The radiation, which at that time 
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obeyed a Planck distribution with the temperature 3 000K, can still be ob- 
served today as cosmic background radiation at a temperature of 2.7 K (due 
to the red shift), and it indicates that the Universe was once in equilibrium. 
The decisive effect of the decoupling of radiation and matter was that radi- 
ation pressure was no longer important, so that the gravitational force had 
only to overcome the pressure of matter to form stars. Thus, the stars, stone 
and life forms of today’s Universe could come into being. 

Gas clouds contract as a result of gravitational attraction. In this process, 
their potential energy decreases and due to energy conservation, their kinetic 
energy increases. It follows from the equipartition theorem that they become 
hotter. The hot clouds emit radiation, thus lowering their energy, and con- 
tract more and more, becoming hotter and hotter. This heating up follows 
from the negativity of the specific heat in systems with gravitational interac- 
tions below the gravitational instability®?. This means that with decreasing 
energy their temperature increases. This feature is fundamental for stellar 
evolution. The instability mentioned can also be observed in computer ex- 
periments**. The thermal instability due to gravitation destroys the thermal 
equilibrium and leads to hot clusters, the stars. The temperature differences 
which result permit the formation of ordered structures including life. As 
was already suspected by Boltzmann, on the Earth this is a result of the fact 
that solar radiation is rich in energy and poor in entropy. As was shown in 
Sect. 4.5.4, the entropy of a photon gas is roughly equal to the product of 
the Boltzmann constant and the photon number, and the energy per photon 
is kT. Photons with the thermal energy of the solar surface of ~ 6000 K 
can be split up into 20 photons through processes on the Earth’s surface, 
each with an energy corresponding to 300 K (& temperature of the Earth’s 
surface). One need only recall that the energy of the Earth does not change; 
just as much energy reaches the Earth from solar radiation in the visible re- 
gion as is again reradiated in the form of long-wavelength infrared photons. 
In this process, the entropy of the photons increases by a factor of 20. Even 
if as a result of these processes structures are formed which are ordered and 
have entropies lower than the equilibrium value, the entropy balance still re- 
mains positive, i.e. the total entropy continues to increase. It is in the end the 
thermodynamic instability of gravitational systems which makes life possible. 

In the further long-term evolution (10'° years), stars will collapse after 
they have used up their nuclear fuel, forming neutron stars or, if their masses 
are sufficient, black holes. The phase space of black holes and thus their 
entropies are so large that the ratio of phase space volumes in the final state 
|I;| and in the initial state || will attain a value (according to an estimate 


33 J. Messer, Lecture Notes in Physics 147 (1981); P. Hertel and W. Thirring, 
Ann. Phys. (N.Y.) 63, 520 (1971) 

34H. Posch, H. Narnhofer, and W. Thirring, J. Stat. Phys. 65, 555 (1991); 
Phys. Rev. A 42, 1880 (1990) 
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by Penrose®’) of 
Ps 


= 19010” 
[Ti 


The enormous increase in entropy associated with the gravitational insta- 
bility can be accompanied by local entropy decreases, leaving room for a 
multiplicity of highly organized structures such as gorgons, mermaids, and 
Black Clouds*®. .. 
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Problems for Chapter 10 


10.1 Verify the equations (10.2.11) and (10.2.12). 


10.2 Solve the equation of motion for a chain of atoms (10.2.2) by introducing 
the coordinates ran = /m 44n, and ®an41 = Vm Q(qn — qn41). This leads to the 
equations of motion 
dxn 
dt 
whose solution can be found by comparing with the recursion relations for the Bessel 


functions (see e.g. Abramowitz /Stegun, Handbook of Mathematical Functions). Ref- 
erence: E. Schrédinger, Ann. der Physik 44, 916 (1914). 


= — 2(@n41— Zn-1), 


35 R. Penrose, The Emperor’s New Mind, Oxford Univ. Press, Oxford, 1990, chapter 
7; based on the Bekenstein—Hawking formula under the assumption that the final 
state consists of a single black hole. 

36 F. Hoyle, The Black Cloud, Harper, New York, 1957. 
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10.3 Recurrence time. 


Complete the intermediate steps of the calculation given in 10.3.1 for the recurrence 
time in a chain of N harmonic oscillators. Use the following representation of the 
Bessel function for integral n: 


In(x) = (eu [ov e'* 5? cos nd 


Tv 0 
ra [= looste nn /2—n/4) + O(1/z)| . 


10.4 Calculate the integral which occurs in Eq. (10.3.19), Cie a a4 —z, with 
z=w+in, 7 > 0, by applying the theorem of residues. 


10.5 A microscopic model of Brownian motion. 


(a) Calculate the inverse Green’s function for the model treated in Sect. 10.3 using 
a continuous cutoff function c(k): 


D(z) = M(2 Q) a® pak Ie(k)|? 


m(2z7)3 2? — |ck|? 


\e(k)|? = a ae 
=19 k? + A? z 


(b) Determine the poles of D(z)~' for large values of A. 
(c) Carry out the integration in the solution of the equation of motion 


dw 


eae) = f Pe D(2)" MQ) - 290), 


—oo 


by expanding the residues for large A up to the order O(A~’). Cf. also P. C. Aichel- 
burg and R. Beig, Ann. Phys. 98, 264 (1976). 


10.6 Stochastic forces in a microscopic model of Brownian motion. 
(a) Show that for the model discussed in Sect. 10.3 of a heavy particle coupled 
to a bath of light particles, the heavy particle is subjected to a force F(t) which 


depends on the initial conditions of the light particles and whose half-range Fourier 
transform 


F(z) = / dt e'*' F(t) 
0 
(with z=w+in, 7 > 0) has the form 


1 (0) — izQx(0) 
= ——— c(k) ——, ——- . 
VmN » we = 2 
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(b) Compute the correlation function (F'(t)F(t’)) under the assumption that the 
light particles are in thermal equilibrium at the time t = 0: 


(Qx(0) Que (0)) = Seek, (Qe (0)Que(0)) = Se, “ 


k 


Express the sum over k which occurs in terms of an integral with a cutoff A, as in 
(10.3.18), and assume that the heavy particle couples only to the light particle at 
the origin: c(k) = g. Discuss the correlation function obtained and find a relation 
between its prefactor and the damping constant I’. 


10.7 The time-independence of the Gibbs entropy. 


Let p(p,q) with (p,q) = (p1,..-,p3n,@,---93n) be an arbitrary distribution func- 
tion in phase space. Show using the microscopic equation of motion (Liouville equa- 
tion): 


OH Op | Op OH 


that the Gibbs entropy Sq = —k f dI’plog p is stationary: Sa =0. 
10.8 The urn model?’. 


Consider the following stochastic process: N numbered balls 1,2,...N are divided 
between two urns. In each step, a number between 1 and WN is drawn and the 
corresponding ball is taken out of the urn where it is found and put into the other 
urn. We consider the number n of balls in the first urn as a statistical variable. 

Calculate the conditional probability (transition probability) T;,,,/ of finding n’ 
balls in the first urn if it contained n balls in the preceding step. 


10.9 For Ehrenfest’s urn model defined in problem 10.8, consider the probability 
P(n,t) of finding n balls in the first urn after t steps. 

(a) Can you find an equilibrium distribution P.q(n,t)? Does detailed balance apply? 
(b) How does the conditional probability P(0,no|t,n) behave for t — oo? Discuss 
this result. Hint: the matrix T,,,,/ of transition probabilities per time step has the 
eigenvalues Ay = 1—2k/N, k =0,1,...,N. Furthermore, T,,,, anticommutes with 
a suitably chosen diagonal matrix. (Definition: P(to, no|t,n) = the probability that 
at the time t the first urn contains n balls, under the condition that at time to there 
were no balls in it.) 


10.10 The urn model and the paramagnet. 


The urn model with N balls (problem 10.8 and problem 10.9) can be considered 
as a model for the dynamics of the total magnetization N of non-interacting Ising 
spins. Explain this. 


37 A model for the Boltzmann equation and for irreversibility in which the typical 
behavior can be calculated in a simple manner is the urn model. Although in 
principle one of the urns could fill up at the expense of the other, this path is so 
improbable that the system tends with an overwhelming probability towards the 
state with equipartition of the balls and then exhibits small fluctuations around 
that state. The urn model is analyzed here in a series of problems. 
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10.11 The urn model and the H theorem. 
Let X be the number of balls in urn 1 after t time steps, and 


_ Xi Xe N-Xi, N-X 
VN ON” wn NW.” 


At 


Study the time dependence of H; for a system with Xo = N by carrying out a 
computer simulation. Plot the time development of A; = X:/N — 1/2 for several 
runs of this stochastic process. What do you observe? Discuss the relation between 
your observation and the Second Law. 


10.12 The urn model for large N. 


(a) Calculate the average time dependence of A; from the preceding problem for 
very large values of N. It is expedient to introduce a quasi-continuous time 7 = 
t/N and to consider the quantity f(r) = An-;. Write a difference equation for 
(f(7 + 1/N)) pry=% based on the equation of motion of the probability P,(t) of 
finding n of the N balls in the first urn after the time step t. Taking the limit 
N — oo and averaging over f, you will obtain a differential equation for (f(7)). 
(b) Calculate also the mean square deviation u(r) = (f(r)”) — (f(r))?. What do 
you conclude for the time dependence of the non-averaged quantity f(r)? 
(c) Compare the result obtained with the result of the simulation from the preced- 
ing problem. Explain the connection. 

Reference: A. Martin-Lof, Statistical Mechanics and the Foundation of Ther- 
modynamics, Springer Lecture Notes in Physics 101 (1979). 


10.13 The Fokker—Planck equation and the Langevin equation for the urn model. 


In Ehrenfest’s urn model with N balls, let X; be the number of balls which are in 
the left-hand urn after t steps. Consider the time development of x(7) := VN f(r), 
where f(r) = Xn-,/N — 4 (see problem 10.12). 

(a) Set up the Fokker—Planck equation for P(x,7) by calculating the average and 
mean square jump length (x(7 + #) — 2(T))e(r)=x and ([2(7 + 7) — 2(T)]?)a(r)=2- 
For this, you can use the intermediate results obtained in problem 10.12. 

(b) Do you recognize the equation obtained? Give its solution for P(x,7) by com- 
paring with the case treated in Chap. 8, and read off the results for (f(7)) and 
u(r) = ([f(r) — (f(r))]*) obtained in a different way in problem 10.12 (each under 
the condition that f(r = 0) = fo). 

(c) What is the associated Langevin equation? Interpret the forces which appear. 
Compare the potential which corresponds to the non-stochastic part of the force 
with the Boltzmann entropy Sp(x) = klog|I,|, where I, is the set of microstates 
characterized by x. (Use the fact that the binomial distribution can be approximated 
by a Gaussian distribution for large N.) 


Appendix 


A. Nernst’s Theorem (Third Law) 


A.1 Preliminary Remarks on the Historical Development of 
Nernst’s Theorem 


Based on experimental results', Nernst (1905) originally postulated that 
changes in the entropy AS in isothermal processes (chemical reactions, phase 
transitions, pressure changes or changes in external fields for T = const.) have 
the property 


AS — 0 


in the limit T — 0. This postulate was formulated in a more stringent way 
by Planck, who made the statement S — 0, or, more precisely, 


Bey ont 
where, depending on the physical situation, N is the number of particles or 
of lattice sites. One refers to (A.1) as Nernst’s theorem or the Third Law of 
thermodynamics”. 

According to statistical mechanics, the value of the entropy at absolute 
zero, T = 0, depends on the degeneracy of the ground state. We assume that 
the ground state energy Eo is go—fold degenerate. Let Po be the projection 


! The determination of the entropy as a function of the temperature T is carried 
out by measuring the specific heat Cx (T) in the interval [T>, 7] and integrating 


according to the equation S(T) = So + iB dT a EUS where the value So at the 


initial temperature To is required. Nernst’s Theorem in the form (A.1) states 
that this constant for all systems at T = 0 has the value zero. 

Nernst’s theorem is understandable only in the framework of quantum mechanics. 
The entropy of classical gases and solids does not obey it. Classically, the energy 


bo 


2 
levels would be continuous, e.g. for a harmonic oscillator, E = s (4 + mw?) 


instead of EF = hw (n + z). The entropy of a classical crystal, effectively a system 
of harmonic oscillators, would diverge at T = 0, since per vibrational degree 
of freedom, S = k+klogT. In this sense, Nernst’s theorem can certainly be 
regarded as visionary. 
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operator onto states with E = Eo. Then the density matrix of the canonical 
ensemble can be cast in the form 


e fH a oe e PEx |n\ (n| Be ga ES Sie e A(En— Eo) |r) (n| 


0S Tre-oH yee 7 90 + om sy @ OMe) 
(A.2) 
For T = 0, this leads to p(T = 0) = ae and thus for the entropy to 
S(T = 0) = —k(log p) = klog go . (A.3) 


The general opinion in mathematical physics is that the ground state of 
interacting systems should not be degenerate, or that the degree of degeneracy 
in any case should be considerably less than the number of particles. If go = 
O(1) or even if go = O(N), we find 


lim ———~ =0, (A.4) 


i.e. for such degrees of degeneracy, Nernst’s theorem follows from quantum 
statistics. 

In Sect. A.2, the Third Law is formulated generally taking into account 
the possibility of a residual entropy. This is in practice necessary for the 
following reasons: (i) there are model systems with greater ground-state de- 
generacies (ice, non-interacting magnetic moments); (ii) a very weak lifting 
of the degeneracy might make itself felt only at extremely low temperatures; 
(iii) a disordered metastable state can be ‘frozen in’ by rapid cooling and 
retains a finite residual entropy. We will discuss these situations in the third 
section. 


A.2 Nernst’s Theorem and its Thermodynamic Consequences 


The General Formulation of Nernst’s Theorem: 


S(T = 0)/N is a finite constant which is independent of parameters X such 
as V and P (i.e. the degeneracy does not change with X) and S(T) is finite 
for finite T. 


Results of Nernst’s theorem for the specific heat and other thermodynamic 
derivatives: 


Let A be the thermodynamic state which is attained on increasing the tem- 
perature starting from T = 0 at constant X. From Cx = T (33) $e it follows 
that 


Cx(T) 
ee 


S(T) - S(T =0) = | “OW (A.5) 
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From this, we find furthermore 
Cx(T) —0 for T—0, 


since otherwise S(T) = S(I = 0) +c = oo. This means that the heat 
capacity of every substance at absolute zero tends to zero; in particular, we 
have Cp — 0, Cy — 0, as already found explicitly in Chap. 4 for ideal 
quantum gases. Thus the specific heat at constant pressure takes on the form 


Cp =T*(atbT+...), (A.6) 


where x is a positive exponent. For the entropy, (A.5), one obtains from this 
expression 


S(T) = S(T =0)+T? (+25 +...) (A.7) 


Other thermodynamic derivatives also vanish in the limit T — 0, as one can 
see by combining (A.7) with various thermodynamic relations. 


The thermal expansion coefficient @ and its ratio to the isothermal compress- 
ibility fulfill the relations 


1 /av 1 (aS 
= = f T A. 
. an vlgol: ered ete) 
Qa OP Os 
I ENS. ef Oe for “PSs A. 
Rr (ar), Ge Ee ae on 


The first relation can be seen by taking the derivative of (A.7) with respect 
to pressure 


OV Os a’ UT 
— — = Te t t Pere } A.l 
Oe (sz), (5). (= atl ) ee) 


the second relation is found by taking the derivative of (A.7) with respect 
to V. 
From the ratio of (A.10) and (A.6) we obtain 


Se Sete KT. 


In an adiabatic pressure change, the temperature changes as* dT = (G2) TaP. 
A finite temperature change requires that dP increase as aa Absolute zero 
therefore cannot be reached by an adiabatic expansion . 


3 (22) ae (37) p = 
oT /s 


516 Appendix 


To clarify the question of whether absolute zero can be reached at all, 
we consider the fact that cooling processes always take place between two 
curves with X = const., eg. P = Pi, P = Py (P,; > Pp») (see Fig. A.1). 
Absolute zero could be reached only after infinitely many steps. An adiabatic 
change in X leads to cooling. Thereafter, the entropy must be decreased by 
removing heat; since no still colder heat bath is available, this can be done 
at best for T’ = const. If a substance with a T — S diagram like that shown 
in Fig. A.2 were to exist, i.e. if in contradiction to the Third Law, S(T = 0) 
were to depend upon X, then one could reach absolute zero. 


X=Xq 


adiabatic 
cooling 


S 
Fig. A.1. The approach to absolute Fig. A.2. Hypothetical adiabats 
zero by repeated adiabatic changes which would violate the Third Law 


(e.g. adiabatic expansions) 


A.3 Residual Entropy, Metastability, etc. 


In this section, we shall consider systems which exhibit a residual entropy even 
at very low temperatures, or metastable frozen-in states and other particular 
qualities which can occur in this connection. 


(i) Systems which contain non-coupled spins and are not subject to an ex- 
ternal magnetic field have the partition function Z = (25 + 1) Z’ and the 
free energy F = —kT N log(2S+1)+ F’. The spins then have a finite residual 
entropy even at T = 0: 


S(T =0) = Nklog(2S +1) . 


For example: paraffin, C29H42; owing to the proton spins of H, the partition 
function is proportional to Z ~ 247%, from which we find for the residual 
entropy S = 42kN log 2. 

(ii) Metastable states in molecular crystals: the ground state of crystalline 
carbon monoxide, CO, has a uniformly oriented ordered structure of the linear 
CO molecules. At higher temperatures, the CO molecules are not ordered. If 
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Fig. A.3. The structure of ice* Fig. A.4. Two-dimensional ice: O 


oxygen, @ hydrogen, « other possible 
positions of H 


a sample is cooled to below T = as, where Ae is the very small energy differ- 


ence between the orientations CO-OC and CO-CO of neighboring molecules, 
then the molecules undergo a transition into the ordered equilibrium state. 
Their reorientation time is however very long. The system is in a metastable 
state in which the residual entropy has the value 


S(T =0) = klog2% = Nklog2, 


i.e. S = 5.76 J mol~! K~!. The experimental value is somewhat smaller, 
indicating partial orientation. 

(iii) Binary alloys such as 3-brass, (CuZn), can undergo a transition from a 
completely disordered state to an ordered state when they are cooled slowly. 
This phase transition can also be described by the Ising model, by the way. 
On the other hand, if the cooling is rapid, i.e. if the alloy is quenched, then the 
Cu and Zn atoms stay in their disordered positions. At low temperatures, the 
rate of reordering is so negligibly small that this frozen-in metastable state 
remains permanent. Such a system has a residual entropy. 

(iv) Ice, solid H2O: ice crystallizes in the Wurtzite structure. Each hydrogen 
atom has four oxygen atoms as neighbors (Fig. A.3). Neighboring oxygen 
atoms are connected by hydrogen bonds. The hydrogen atom which forms 
these bonds can assume two different positions between the two oxygen 
atoms (Fig. A.4). Because of the Coulomb repulsion, it is unfavorable for an 
oxygen atom to have more or fewer than two hydrogen atoms as neighbors. 
Thus one restricts the possible configurations of the hydrogen atoms by the 
ice rule: the protons are distributed in such a manner that two are close and 
two are more distant from each oxygen atom. For N lattice sites (N oxygen 


“The structure of common (hexagonal) H2O-ice crystals: S.N. Vinogrado, 
R.H. Linnell, Hydrogen Bonding, p. 201, Van Nostrand Reinhold, New York, 
1971. 

° L. Pauling: J. Am. Chem. Soc., 57, 2680 (1935) 
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atoms), there are 2N hydrogen bonds. The approximate calculation of the 
partition function? at T =0 yields 


N N 
ed ee 
4 16 ay * 


(The number of unhindered positions of the protons in the hydrogen bonds) 
times (reduction factor per lattice site, since of 16 vertices, only 6 are allowed). 


Using W = limy- oo Zoe = 1.5, we find for the entropy per H20O: 
S(T =0) 
——_— =] =log1.5. 
EN og W = log 1.5 


An exactly soluble two-dimensional model to describe the structure of ice has 
been given® (Fig. A.4). A square lattice of oxygen atoms is bound together 
by hydrogen bonds. The near-neighbor structure is the same as in three- 
dimensional ice. The statistical problem of calculating Zp) can be mapped onto 
a vertex model (Fig. A.5). The arrows denote the position of the hydrogen 
bonds. Here, H assumes the position which is closer to the oxygen towards 
which the arrow points. Since each of the four arrows of a vertex can have 
two orientations, there are all together 16 vertices. Because of the ice rule, of 
these 16 vertices only the six shown in Fig. A.5 are allowed. 


ob bE + 


Fig. A.5. The vertices of the two-dimensional ice model which obey the ice rule 
(two hydrogen atoms near and two more distant) 


The statistical problem now consists in determining the number of possibil- 
ities of ordering the 6 vertices in Fig. A.5 on the square lattice. The exact 
solution® of the two-dimensional problem is obtained using the transfer ma- 
trix method (Appendix F.). 


N—-oco 


4\3/2 
W = lim ZY" = (5) = 1.596007... . 


The numerical result for three-dimensional ice is’: 


W = 1.50685 + 0.00015, S(T =0) = 0.8154 + 0.0002 cal/K mole 
Experiment at 10 K: S(T = 0) = 0.82 0.05 cal/K mole . 


° E.H. Lieb, Phys. Rev. Lett. 18, 692 (1967); Phys. Rev. 162, 162 (1967) 
” Review: E.H. Lieb and F. Y. Wu in: Domb and Green, Phase Transitions and 
Critical Phenomena I, p. 331, Academic Press, New York, 1972. 
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The approximate formula of Pauling gives a lower limit for the residual en- 
tropy. 

If the orientations of the hydrogen bonds were allowed to be completely 
unhindered, the residual entropy per lattice site would be log 2? = log 4. Due 
to the ice rule (as a result of the Coulomb repulsion), the residual entropy is 
reduced to log 1.5. If other interactions of the protons were taken into account, 
there would be still finer energy splittings among the various configurations of 
the vertex arrangements. Then, on lowering the temperature, only a smaller 
number would be allowed and presumably at T — 0 no residual entropy would 
be present. The fact that ice has a residual entropy even at low temperatures 
indicates that the reorientation becomes very slow under these conditions. 
(v) The entropy of a system with low-lying energy levels typically shows the 
dependence shown in Fig. A.6. Here, the value of the entropy between T; and 
T> is not the entropy So. In case energy levels of the order of kT, are present, 
these are practically degenerate with the ground state for T >> T;, and only 
for T < T, is the residual entropy (possibly Sp = 0) attained. An example of 
this is a weakly coupled spin system. The plateau in the temperature interval 


Fig. A.6. The entropy of a system with 


Ty 12 T energy levels of the order of kT, and kT2 


(T,, TZ] could appear as a residual entropy on cooling. In this interval, the 
specific heat is zero. In the region of T;, the specific heat again increases with 
decreasing temperature, then drops below JT; towards the value zero after 
the degrees of freedom at the energy kT; are frozen out; this could possibly 
indicate a final decrease of the entropy to its value at T = 0. 

For degrees of freedom with a discrete excitation spectrum (spins in a field, 
harmonic oscillators), the excitation energy determines the temperature be- 
low which the entropy of these degrees of freedom is practically zero. This is 
different for translational degrees of freedom, where the energy levels become 
continuous in the limit N — oo and for example the spacing of the first ex- 
cited state from the ground state is of the order of Suse The corresponding 
excitation temperature of about 5 x 107!°K is however unimportant for the 
region of application of the Third Law, which already applies at considerably 
higher temperatures. The spacing of the energy levels tends towards zero in 
the thermodynamic limit, and they are characterized by a density of states. 
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The temperature dependence of the entropy and the specific heat does not 
depend on the value of individual energy levels, but instead on the form of 
the density of states. For crystals, the density of states of the phonons is 
proportional to the square of the energy and therefore it gives S x T° at low 
temperatures. The density of states of the electrons at the Fermi energy is 
constant, and thus one obtains S « T. 

(vi) It is also interesting to discuss chemical substances which exhibit al- 
lotropy in connection with the Third Law. Two famous examples are carbon, 
with its crystalline forms diamond and graphite, and tin, which crystallizes 
as metallic white tin and as semiconducting grey tin. White tin is the high- 
temperature form and grey tin the low-temperature form. At Ty = 292K, 
grey tin transforms to white tin with a latent heat Q;,. Upon cooling, the 
transformation takes place in the reverse direction, so long as the process oc- 
curs slowly and crystallization seeds of grey tin are present. On rapid cooling, 
white tin remains as a metastable structure. For the entropies of white and 
grey tin, the following relations hold: 


T 
Sw(T) = Sw(0) + i Cw(T) 
fh 
Sa(T) = Sa(0)+ f Ca(C) 


From the general formulation of Nernst’s theorem, it follows that 


since the two forms are present under identical conditions. (Statistical me- 
chanics predicts in addition for these two perfect crystal configurations 
Sw (0) = Sc(0) = 0.) It thus follows that 


T 
Sw(P) — Sq(T) = | aT (Cw(T) — Cc(T)) . 


From this we find in particular that the latent heat at the transition temper- 
ature To is given by 


T dT 
Qu(To) = To(Sw (To) — Se(To)) = T fF (Cw) Col). (Ad 


0 L 
The temperature dependence of the specific heat at very low temperatures 
thus has an influence on the values of the entropy at high temperatures. 

(vii) Systems with continuous internal symmetry, such as the Heisenberg 
model: for both the Heisenberg ferromagnet and the Heisenberg antiferro- 
magnet, owing to the continuous rotational symmetry, the ground state is 
continuously degenerate. Classically, the degree of degeneracy would not, to 
be sure, depend on the number of lattice sites, but it would be infinitely 
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large. For N spin-1/2 sites, in quantum mechanics the z-component of the 
total spin has N + 1 possible orientations. The ground state is thus only 
(N + 1)-fold degenerate (see Eq. (A.4)). This degeneracy thus does not lead 
to a residual entropy at absolute zero. 


Reference: J. Wilks, The Third Law of Thermodynamics, Oxford University Press, 
1961. 
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B.1 The Classical Limit 


We will now discuss the transition from the quantum-mechanical density ma- 
trix to the classical distribution function, beginning with the one-dimensional 
case. At high temperatures and low densities, the results of quantum statis- 
tics merge into those of classical physics (see e.g. Sect. 4.2). The general 
derivation can be carried out by the following method®: 

If we enclose the system in a box of linear dimension L, then the position 
eigenstates |g) and the momentum eigenstates |p) are characterized? by!® 


Glq)=ala) , ala’) =d(q-q'), [ea (q=1, (B.1a) 


Bip) =P\p) .(plp') = opp, do |p) (l= 1, 


eipa/h Qrh 
= —_, withp= —n. (B.1b 
(ap) JE P= (B.1b) 
We associate with each operator A a function? A(p, q), 
A(p, 4) = (p| Ala) (alp) L (B.2a) 


These matrix elements are related to the classical quantities which correspond 
to the operators. For example, an operator of the form A = f(H)g(@) is 
associated with the function 


A(p, q) = (p| f(P) 9 |@) (alp) L = f(p)g(p) - (B.2b) 


8 E. Wigner, Phys. Rev. 40, 749 (1932); G.E. Uhlenbeck, L. Gropper, 
Phys. Rev. 41, 79 (1932); J.G. Kirkwood, Phys. Rev. 44, 31 (1933) and 45, 
116 (1934). 

° For clarity, in this section operators are denoted exceptionally by a ‘hat’. 

10 QM I, Chap. 8 
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The Hamiltonian 


#=H(,4 = +Vv@ (B.3a) 


is thus associated with the classical Hamilton function 


AC ee eee (B.3b) 


2m 


The commutator of two operators is associated with the function 
(p| [A, B] |a) (alp) L 
= L f ad S> { (p| Ala’) (q'|p’) (p'| Bla) — (p| Bla’) (‘lp’) (p'| Alay} 
p' 
x (q|p) 


= 2 [dd (Av. BO.¢) - Be.d)AV'.9)) 


x (plq’) (pla) (q'\p’) (alp) 
(B.3c) 


according to (B.2b), where (p|q’) (q'|p) = = was used. We note at this point 
that for the limiting case of large LD relevant to thermodynamics, the sum- 
mation 


Ee aq [ (B.3d) 


can be replaced by an integral and vice versa. The expression in round brack- 
ets in (B.3c) can be expanded in (q’ — q) and (p’ — p): 


A(p, 7’) B(p", g) — Blip, DA(P', a) = 


(40.0 + (q 05 (a ore.) 
x (20.9 + (p" D> + (0! ws s) 
= (B09 (q ee (a yoo Bs ) 


x (40.0 + (p’ D> (0! p) Op? +...) . (B.3e) 


The zero-order terms cancel, and pure powers of (q’ — q) or (p' — p) yield zero 
on insertion into (B.3c), since the p’-summation and the q’-integration lead 
to a 6-function. The remaining terms up to second order are 
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(p| [A, B] |@) (alp) L = 


=i / ae q)(p' — p) Te = (plq’) (p'|a) (a' |p") (alp) 

(B.3£) 
= L(G (0 P) a) Fo) (ap) 
pla) ; =208 B) 


where the scalar product (B.1b) and Eq. (B.1a) have been inserted. For higher 
powers of (¢ — q) and (jp — p), double and multiple commutators of g and p 
occur, so that, expressed in terms of Poisson brackets (Footnote 4, Sect. 1.3), 
we finally obtain 


(p| (A, B]|q) (alp) L = "tA, BY + O(n’). (B.4) 


Application of the definition (B.2a) and Eq. (B.2b) to the partition func- 
tion leads to 


Zi=Treh# me (ple P29 |p) = Xf ae (ple PMP Iq) (ql) 
x f aa wl ( (pl (e PX Me PV + Of )) Ia) (alp) (B.5) 


pare + O(h = [Fo e PH.) 4 Oh hi) . 


I 


Z is thus — apart from terms of the order of h, which result from commutators 
between K(p) und V(g) — equal to the classical partition integral. In (B.5), 
K = K(p) is the operator for the kinetic energy. 

Starting from the density matrix p, we define the Wigner function: 


p(p,q) = (pla) (al P lp) - (B.6) 


L 
Qh 
Given the normalization of the momentum eigenfunctions, the factor nae is 
introduced in order to guarantee that the Wigner function is independent of 
L for large L. 

The meaning of the Wigner function can be seen from its two important 


properties: 


(1) normalization fa [ap P(p,q) = fax (plq) (al 6 |p) 


=Trp=1. 


(B.7) 


Here, the completeness relation for the position eigenstates, (B.1a), was used. 
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(2) mean values fa [ap p(p, gq) A(p, g) 
= fag S7 = (ola) (al) lAla) (ap) — gg 


= fag (al Ae) | Ala) = Te). 


Pp 


Following the second equals sign, (p|q) (q|p) = ¢ and Eq. (B.3d) were used. 
For the canonical ensemble, we find using (B.5) 


L e-BH 
p(P, 4) = 5-— (pla) (al —— |p) 
L 2 1 
_ ~pR ,—BV = B.9 
aap (nla) (al (ePNe’Y + O(F)) Iv) B (B.9) 
L penn ey) e FH (n.4) 
= aap | (Pla? | za OH = tanz + OW) 
and 
A ot gS dae PPP Alp, a) 
ee + O(h) 
- J dq e— 8H (p,q) 
ees (B.10) 
_ fee PO AG) + Of) 
= f apg e— fH (p,q) : 
The generalization to N particles in three dimensions gives: 
=, : yess) Qn) - (B.11) 
We introduce the following abbreviations for many-body states: 
lq) =|a1)---law) , |p) =|P1)--- |p) , (B.12a) 
eipa/h 
(plp") = Spy» (alP) = Tana ye Ip) (p (B.12b) 


Applying periodic boundary ren the p,; take on the values 


sap (m2, 13) 
nN1, N,N 
Pi = Inf vit? M2) 3 
with integer numbers n,;. 

The many-body states which occur in Nature are either symmetric 
(bosons) or antisymmetric (fermions): 


B. The Classical Limit and Quantum Corrections 525 


1 
lp), = Tm eh P Ip) . (B.13) 


The index s here stands in general for “symmetrization”, and includes sym- 
metrical states (upper sign) and antisymmetrical states (lower sign). This 
sum includes N! terms. It runs over all the permutations P of N objects. For 
fermions, (—1)? = 1 for even permutations and (—1)? = —1 for odd permu- 
tations, while for bosons, (+1)? = 1 always holds. In the case of fermions, 
all of the p; in (B.13) must therefore be different from one another in agree- 
ment with the Pauli principle. In the case of bosons, the same p; can occur; 
therefore, these states are in general not normalized: a normalized state is 
given by 


1 
IP) sn > Spall |p 


where n; is the number of particles with momentum p;. We have 


Neg (B.14) 


n / x ny! ng!... A 
TrA= S- sn (p| A|p) sn = S- ee (p| A |p) sn 


P1;---,.PN P1;---, PN 
; (B.15) 
Piy-sPN 


The prime on the sum indicates that it is limited to different states. For 
example, pyp2... and p2p;... would give the same state. Rewriting the 
partition function in terms of the correspondence (B.2b) yields 


= 1 _pH 
Z= Tre Ph = — D7 a tple Ip) 


{pi} 
1 
= a f Oh «le Ia) (ale) (B.16) 
{pi} 


1 V 3N 3N —BH (p,q) 2 
ac @nhp dp | d’ qe | (q|p), |) + OCA) . 


The last factor in the integrand has the form | (g|p),| = V-“(1+ f (p,q), 
where the first term leads to the partition integral 


PN p BN g 


Namen e PP + O18). (B.16’) 


Remarks: 


(i) In (B.16), the rearrangement «(Pl e BH Ip), = f BX as (ple BK @—BV \q) 
x (gp), + O(h) = f (BN qo BH. | (q\p), 2. +O(h) was employed, where 
the symmetry of Hf under particle exchange enters. 
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(ii) The quantity | (g|p), |? = V-“ (1 + f(p,q)) contains, in addition to the 
leading term V~‘ in the classical limit, also p- and q-dependent terms. 
The corrections due to symmetrization yield contributions of the order 
of h®. Cf. the ideal gas and Sect. B.2. 

(iii) Analogously (to B.16), one can show that the distribution function is 


e— 8H(p.9) 
pP(p, q) ~~ Z(27h)3N N! - 


We have thus shown that, neglecting terms of the order of A, which result 
from the non-commutativity of the kinetic and the potential energies and the 
symmetrization of the wave functions, the classical partition integral (B.16’) 
is obtained. 

The classical partition integral (B.16’) shows some features which indicate 
the underlying quantum nature: the factors 1/N! and (27h)~3. The first of 
these expresses the fact that states of identical particles which are converted 
into one another by particle exchange must be counted only once. This fac- 
tor makes the thermodynamic potentials extensive and eliminates the Gibbs 
paradox which we discuss following Eq. (2.2.3). The factor (27h)~?% renders 
the partition integral dimensionless and has the intuitively clear interpreta- 
tion that in phase space, each volume element (27h)? corresponds to one 
state, in agreement with the uncertainty relation. 


(B.17) 


B.2 Calculation of the Quantum-Mechanical Corrections 


We now come to the calculation of the quantum-mechanical corrections to 
the classical thermodynamic quantities. These arise from two sources: 

a) The symmetrization of the wave function 

b) the noncommutativity of K and V. 

We will investigate these effects separately; their combination yields correc- 
tions of higher order in h. 

a) We first calculate the quantity | (q|ps) |?, which occurs in the second line 
of (B.16), inserting Eq. (B.13): 


| (ap). |? = a SSS) (-1)” (al P’ Ip) (al P |p)” 


PP? 
= a DLV HP? (P'alp) (Palpy" 
P Pp! 
~ wD)” (alp) (PP’~*alp)” (B.18) 
‘Pp Pr 
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Here, in the second line, we have used the fact that the permutation of the 
particles in configuration space is equivalent to the permutation of their spa- 
tial coordinates. In the third line we have made use of the fact that we can 
rename the coordinates within the integral [ d?“q which occurs in (B.16), 
replacing P’q by q. In the next-to-last line, we have used the general prop- 
erty of groups that for any fixed P’, the elements PP’~' run through all 
the elements of the group. Finally, in the last line, the explicit form of the 
momentum eigenfunctions in their configuration-space representation was in- 
serted. 

Inserting the final result of Eq. (B.18) into (B.16), we can express each of 
the momentum integrals in terms of 


3 — 8P" + ipx co 3 — 8p* 
d’pe = | d’pe f(x), (B.19) 
with 
ax2 
f(x)=e 7 , (B.20) 
where \ = aot [Eq. (2.7.20)] is the thermal wavelength. Then we find 


for the partition function, without quantum corrections which result from 
non-commutatitivity, 


3N 3N 
aga" P | pH(p,a) 


= | Nianene S\(-))? f(ai-Pan)--. f(av—Paw) . (B.21) 


P 


The sum over the N! permutations contains the contribution f(0)% = 1 for 
the unit element P = 1; for transpositions (in which only pairs of parti- 
cles i and j are exchanged), it contains the contribution (f(qi — q;))*, etc. 
Arranging the terms according to increasing number of exchanges, we have 


So-D fa — Paqi)::: flaw — Pan) = 


y 


I 


ED (F(a - aj) + S> f(ai — ay) fay — ae) (Ge — ai) £-.- 
i<j ijk 
(B.22) 


The upper sign refers to bosons, the lower to fermions. For sufficiently high 
temperatures, so that the average spacing between the particles obeys the 
inequality (v is the specific volume) 


v3 > d, (B.23) 
we find that f(q; — q,;) is vanishingly small for |q; — q;| >> A, and therefore 


only the first term in (B.22) is significant; according to the preceding section, 
it just yields the classical partition integral, (B.16). 
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The more factors f that are present in (B.22), the stronger the constraints 
on the spatial integration region in (B.16). The leading quantum correction 
therefore comes from the second sum in (B.22), which we can rewrite in the 
following approximate way: 


1+ "(f(ai—ay))” S [[a + (f(ai—4y))”) = eB Lig; Bi (ai—9y) | (B.24) 


t<j 1<j 


Here, the effective potential 


(ae — aj) = —T log (1 +e“ 27I4-asl/”) (B.25) 


is attractive for bosons and repulsive for fermions. This effective potential 
arises from the symmetry properties of the wave function and not from any 
microscopic mutual interaction of the particles. It permits us to take the 
leading quantum correction into account within the classical partition inte- 
gral. For the ideal gas, these quantum corrections lead to contributions of the 
order of f° in the thermodynamic quantities, as we have seen in Sect. 4.2. 


b) The exact quantum-mechanical expression for the partition function is 
given by 


1 ge 
ZH > s (ple BH Ip) 
{pi} (B.26) 


= x (ais) fore far s (ple ?* |q) (alp) s - 


If we neglect exchange effects (symmetrization of the wave function), we 
obtain 


Z= a (aos) [arr [are (pj e~9# |a) (alp) 


= a (ais) [ore fran, 


To compute the integrands which occur in this expression, we introduce the 
following relation, initially for a single particle, 


(B.27) 


T= (ple~* |q) (q|p) V = ei? Me PH val (B.28) 


After the last equals sign and in the following, H denotes the Hamiltonian in 
the coordinate representation. To calculate I, we derive a differential equation 
for J using the Baker—Hausdorff formula: 
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— —eipa/h Ffe— BH e—ipa/h _ __pipa/h FF Q—ipa/hy 
+ .[ Pa 4 i ; 
= (a i| =H] Re (pq, (pq, H]] tea} (B.29) 


hn 20 op? 
p I}. 
2m h° Oq fh? 
The higher-order commutators (indicated by dots) vanish, so that 


al he ( % 0 & 
sy > |-Hoo+ = (-Fog + sa) ]t. (B29 


lI 
=> 


where H(p,q) is the classical Hamilton function. To solve this differential 
equation, we use the ansatz: 


y = FH.) f (B.30) 


We find the following differential equation for x from (B.29’): 


Ox ol gen es eo 
CA _ A BH (p,q) ~~ — oH (p.a) T 
gp NO ag Oe Pog oF 
he (2i ao @ 
— ebH(p.9) BH (p,q) 
a (Feast 5a) « 
_WB[2ipaV Ap da _ AV | (av)? ve) 
~ ImI|h dq hBdq Aq Oq 
OOD 5 pO 
dq Oq © B Oq? Me 


Transferring to a many-body system with the coordinates and momenta q; 
and p; yields 


OB 4“2m;| h OG hB OG Ogi? 


OV\? ava a? 
2 + Zot . (B31 
3(5r) Ogi OF: id Oqi? A ( : ) 


Ox y 28 2 AV ip, 0. PV 


4 


The solution of this equation is obtained with the aid of a power series ex- 
pansion in hi: 


X= 1t+hy t+ hx. + O(F*) . (B.32) 


Because of (B.28) and (B.30), y must obey the boundary condition . = 1 for 
G3 = 0. Inserting this ansatz into (B.31’), we obtain 
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OxA pi O 
ONE a2 os B.33 
7a es = con 
and 
ee 7 Bs : ipa x oF dip, A - poh + + 3? ; : (B.33b) 
From this, it follows that 
2 
es ora ua (B.34a) 
My on 
zs pi O B Di Dk OPV 
aS » m; O a » 7 ms Mr OVO 
(oka 1 (OV 3? 1 0V 
: B.34b 
Inserting (B.30) and (B.27), we finally obtain the partition function 
db gq PN p BH 
= i ( ’ ) 2 
- | mee PD(1+ hxi + he x2) . (B.35) 


The term of order O(h) vanishes, since 1 is an odd function of p;, so that 
the remaining expression is 


Z = (14h? (x2)q)Za - (B.36) 


Here, ( ),] refers to the average value with the classical distribution function, 
and Z,) is the classical partition function. From it, we thus obtain for the 
free energy 


1 1 he 
F — gee — Fy oa gest + A? (x2)¢1) ~~ Fy —_ Bg veel ‘ (B.37) 
With 
m 
(DiPk) cl = ZSik (B.38) 
B 
and 


(aie) °( (3) ) 


(proof via partial integration), it follows that 


h2 av \? 
P= Fat sam(eDe 2 (Cae i 
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The classical approximation is therefore best at high T and large m. 


Remark: Using the thermal wavelength \ = 27h/W2mamkT and the length | which 
characterizes the spatial variation of the potential (range of the interaction poten- 
d2 v2 


tials), the correction in Eq. (B.39) becomes 35 {> - This gives as a condition for the 
validity of the classical approximation 

A<l (from the non-commutativity of K and V) (B.39a) 
and, according to Eq. (B.23) 

v\Vs 

NK (F) (from symmetrization of the wave function) . (B.39b) 

Rearranging Eq. (2.7.20), one gets 
5 x 107* 5.56 x 10° 


fli [em2}m[g] d2[A7Im[me] 


For electrons in solids, we have (x) oe 


5.5 x 10° K, their behavior remains nonclassical. 
For a gas with the mass number A: m= A: mp, (Rye = 107-'cm, T & 


~ 1A, so that even at a temperature of T = 


B.3 Quantum Corrections to the Second Virial Coefficient B(T) 
B.31 Quantum Corrections Due to Exchange Effects 


We neglect the interactions; however, the second virial coefficient from 
Eq. (5.3.7) 


(B.40) 


is still nonzero due to exchange effects. A two-particle eigenstate has the form 


bbayba) = salle lps) + Iba) [ba)) for bn a 
(B.41) 
\p1) \p1) bosons 


|p1,p2) = for pi = p2 
0 fermions . 


The partition function for two non-interacting particles is 


2 
5 en? /mkT 
Zo = Tre Pitb2)/2mkT _ u S : 7 (Pi+P2)/2mkT 4 J 


ie 0 
1 eee. 1 2 
Paha —(pi+p3)/2mkT 4 = —p* /mkT 
Epona 
1 p2 


1 1 2 bosons 
= Z77+= e P /mkP fo : 
Dob ye fermions e) 
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From this!!, we find for the second virial coefficient (5.3.7): 


re 2 3 
—— —p'/mkT _ — 
BS toy 2 = ape 
Pp 
1 /( nt? \3? bosons 
SAS fh fo Fi B.43 
2 (=) J fermions ( 


B.32 Quantum-Mechanical Corrections to B(T) Due to 
Interactions 


In the semiclassical limit (unsymmetrized wave functions), from Eq. (B.35) 
we obtain for the partition function of two particles 


Ifo 
Zo = = (55) [ae Pars eo Maley EAA +4 Ax4 +h? x2) : (B.44) 
2\X Nee 


This leads to the following expression for the second virial coefficient ((5.3.7), 
(B.40)): 


1 
a” (7 / Pay Pars (errs + Px) -1)) ¢ HE 


The quantum correction is therefore given by 


2 
a dy a= Ee a Ov h? 
Bam = Oy) 24m(kT)? 


oe | re —v(r)/kT Ov : 
= ia aa Ory. 


where in the second line we have assumed a central potential. This quantum 
correction adds to the classical value of B; it is always positive. The exchange 
corrections (B.43) are of the order O(h?). The lowest-order quantum correc- 
tions, i.e. (B.46), are of order h?. At low temperatures, these quantum effects 
(due to non-commuting V and K) become important. The contribution from 
symmetrization is relatively small. 


(B.46) 


B.33 The Second Virial Coefficient and the Scattering Phase 


One can also represent the second virial coefficient in terms of the phase shift 
of the interaction potential. The starting point is the formula for the virial 


11 a oF e-P 2/2mkT _ = 


We do not take the spin degeneracy factor g = 25 + 1 into account here. 
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coefficient, Eq. (5.3.7) 
Z 1 
B=-(3-35] V. (B.47) 


The interaction does not appear in the partition function for a single particle 


p? V p V 
Z1 = ~2mkT = ——__ | Bn e FmkT = — B.48 
—— any | ae ¥ ey 
The Hamiltonian for two particles is given by 
2 2 
A= Pit P2 1 V(x, — xp) (B.49) 
2m 


and, introducing coordinates for the center of mass (CM) and the relative 
position (r): 


1 
XoM = pics! +2), Xp =X2—-X1, (B.50) 
it can be written as 
+ pe p; 
He==OM i254 V(x,). (B.51) 
m m 


Then the partition function for two particles becomes 
2 p? 
—( =24V(x,) } /kT V En 
Zy= Treye For Try é ( ))/ = a Soe. (B.52) 


In this expression, €, denotes the energy levels of the two-particle system in 
relative coordinates taking into account the different symmetries of bosons 
and fermions. It leads to 


B= ee ee - =) ; (B.53) 


We now remind the reader that for non-interacting particles, (B.43) gives for 
the second virial coefficient 


V bosons 
BO = — (29233 Sen kT >) = 1275/23 . (B.54 
d. © 2 fermions ( ) 


The change in the second virial coefficient due to the interactions of the 
particles is thus given by 


B(T) — B°(T) = -29/23 (Cos = e Aen ) (B.55) 


n 
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The energy levels of the non-interacting system are 


2].2 
ee a (B.56a) 
m 


while in the interacting system, along with the continuum states of energy 


272 
geen (B.56b) 
m 


bonding states of energy ¢g can also occur. The values of & are found from 
the boundary conditions and are different for the interacting system and for 
the free system, so that also different densities of states are obtained. The 
number of energy levels g(k)dk in the interval [k, & + dk] defines the density 
of states, g(k). We thus find 


B(T) — BO(T) 


ot eee + / dk (g(k) — g (k))e~£*/*F | | (B.57) 
Bs 0 


The change in the density of states which occurs here can be related to the 
derivative of the scattering phase. We assume that the potential V(r) has 
rotational symmetry and consider the eigenstates of the relative part of the 
Hamiltonian. Then we can represent the wave functions for the free and the 
interacting problem in the form!” 


fi) Ap, Vile) Re @) 
Wrim(X) = AkimYim(V, 9) Rai(r) « 


The free radial functions are given in terms of spherical Bessel functions. The 
asymptotic forms for r — co are 


(B.58) 


1 
RO (r) ae sin( kr + =) 


(B.59) 
Ru(r) = ~ sin(kr see a 61()) 


with the phase shifts known from scattering theory, 6;(k). The allowed values 
of k are found from the boundary conditions 


RY? (R) = Ru(R) = 0 (B.60) 
at a large radius R (which finally goes to infinity). From this it follows that 


l l 
kR+ = an and kR+ > +6i(k) =a, (B.61) 


12 QM I, Chap. 17 
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where n = 0,1,2,.... The values of k therefore depend upon /. Neighboring 
values of k for fixed / differ by 


TT 


O61(k) * 


2 a (B.62) 


AK = = and Ak 


We still must take into account the fact that every value of | occurs with a 
multiplicity of (21+ 1). Since each interval Ak or Ak contains a value of 
k, the densities of states are 


21+ 
SEER Gia GG) 


gi (k) = = (B.63) 


et 36,(k) 
= HD fy OB] 


From this the second virial coefficient follows: 
B(T) — BO (T) 


~ 2872931 enen/kT y 1 i dkS~ fi(2l + 1) AU) mer \ 
Tv 
B 0 l 


(B.64) 


Now we need to determine the values of | which are allowed by the symmetry 
properties. For bosons, we have u(—x) = (x), and for spin-1/2 fermions, 
w(—x) = +u(x), depending on whether a spin singlet or a triplet state is 
considered. For spin-0 bosons, we thus have | = 0,2,4,... and f; = 1. For 
spin-1/2 fermions, 


1=0,2,4,... =1 singlet 
fi ( e ) (B.65) 
b= 1,355 50:. fi=3 (triplet) . 
The change in the second virial coefficient is expressed in terms of the binding 
energies and the phase shifts. An important contribution to the k-integral 
comes from the resonances. Very sharp resonances have 26 (k) = 70(k — ko), 
and one obtains a similar contribution to that of the bonding states, however 
with positive energy. More generally, one can interpret the quantity 
100(k) OE 06, O61 


A Ok  ORROE OB’ 


as velocity times the dwell time!® in the potential. The shorter the dwell time 
within the potential, the more nearly ideal is the interacting gas. 
Literature: 

S.K. Ma, Statistical Mechanics, Sect. 14.3, World Scientific, Singapore, 1985 


E. Beth and G. E. Uhlenbeck, Physics 4, 915 (1937) 
A. Pais and G.E. Uhlenbeck, Phys. Rev. 116, 250 (1959) 


13 See e.g. QM I, Eq. (3.126) 
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Problems for Appendix B.: 


B.1 Carry out in detail the rearrangements which occur in Eq. (B.3f). 

B.2 Carry out the rearrangement in Eq. (B.28). 

B.3 Show that (B.29’) follows from (B.29). 

B.4 Determine the behavior of the effective potentials 0(x) in Eq. (B.25) for small 
and large distances. Plot #(x) for bosons and fermions. 


C. The Perturbation Expansion 


For the calculation of susceptibilities and in other problems in which the 
Hamiltonian H = Ho + V is composed of an “unperturbed” part and a 
perturbation V, we require the relation 


1 
efot+V — eHo +f dtefHoyell-Ho + O(V?) (C.1) 
0 


To prove this relation, we introduce the definition 
A(t) = e% te Hot 

and take its time derivative 
A(t) = e!*(H — Ho)e~ Hot = ce *'VeW Bot - 


By integrating over time between 0 and 1, 

A(1) — A(0) = e%e-#o —1 = [ aeetverte ; 

0 

we obtain after multiplication by e° the exact identity 

ef — eMo 4 [aectrvel-nm (C.2) 

0 

Expanding e#! = e(#0+V)t in a power series, we obtain the assertion (C.1). 
Iteration of the likewise exact identity which follows from (C.2), 

eft — Mot + [aw eH Velt-*)Ho (C.2') 

0 

yields 


1 
ef — eMo +f dt eMotVel—) Hoy 
0 


1 t 
‘h a dt! efot’ Velt-t’) Hoy (1-4) Ho +e..dt 


x rate t1) Vt lee, 


(C.3) 
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With the substitution 
1 — tp = Un, 1 — tn-1 =Un-1,---,l- = U1 


we obtain 


oe) 1 A: 1 
n=179 U1 Un=1 


x Velt2 41) Ho petite | 
(C.3’) 


D. The Riemann ¢-Function and the Bernoulli Numbers 


In dealing with fermions, the following integrals occur: 


: fa weed 
T(v) Ct a a kv 
‘ z 


After the last equals sign, the Riemann ¢-function 
1 
¢(v) = » a for Rev > 1 (D.2) 


was introduced. The integrals which occur for bosons can also be related 
directly to it: 


1 i qv} oo 1 
I(v) = e*—1_ by pe (D.3) 


According to the theorem of residues, ¢(v) can be written in the following 
manner: 
1 t 1 
Oe fae TZ fae. (D.4) 
Ag 


4i 4 zy 
C Cc’ 


Definition of the Bernoulli numbers: 


ae yB sea (D.5) 
—zcot=z=1- na ; 
rea a" Qn)! 
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Fig. D.1. The integration 
path in (D.4) 


1 1 1 
1 ’ 2 30 ’ 3 49 ’ 
vy =2k 
= = °° (2z3)?” 
c(ak) = f ag Lent Be “Gor __ (2 )*B (D.6) 
Ai 2k rz 2(2k)! ? 
(Ou 


since only the term n = k makes a nonzero contribution. 


1 i g2k-l (2ek—2 _ 1) nw? By, 


Tek! J ee +1 (2k)! 
love) g2k-1 art _ 1) 17k By 
d = 
i verti 2k (D7) 
ete ck (20)?*Br — (20)?* By 
= (2k -1)! = 
| apiece pom 4k pe) 


EK. Derivation of the Ginzburg—Landau Functional 


For clarity, to carry out the derivation we first consider a system of ferromag- 
netic Ising spins (n = 1), which are described by the Hamiltonian 


1 
HF I~ NSS — HDS an 


Lv 


where 5} takes on the values S; = +1. We assume a d-dimensional, simple 
cubic lattice; its lattice constant is taken to be ag and the side length of the 
crystal to be L. This d-dimensional lattice is then divided into cells of volume 
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Beye 
a, 


Fig. E.1. Division of the lattice into cells 


v = a‘, whereby the linear dimensions of the cells a, are assumed to fulfill 
the inequality a9 < a. < L. The number of cells is N. = (2 = %, and 


the number of lattice points within a cell is N= eae Finally, we define 
the cell spin of cell v: 


1 
my ==> Si, (E.2) 
N lev 


whose range of values lies in the interval —1 < m, < 1. We now define a new, 
effective Hamiltonian F({m,}) for the cell spins by carrying out the exact 
rearrangement 


—BH — —BH —BH 
pane = eT bp, sim) 
{S;=+1} {mv} Vv 


E.3 
= S- e BF ({mv}) ee) 
{m} 
which corresponds to a partial evaluation of the trace, i.e. 
1 S 
Fi({mv}) = “3 log S- oe bH IL és.., kt, + (E.4) 
{S;=+1} v 


For sufficiently many spins per cell, m, becomes a continuous variable 
(Am, = 2) 


Dee fam. (E.5) 


F({m,}) = Nf ({m,}) for sufficiently large N . 
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The field term expressed in the new variables is 


—hS 8 =-hS OS IS =-ANS om . (E.6) 
I Vy 


v lev 


Thus, the factor e~? 2151 = e~"N 2X, ™ is not affected at all by the trace 
operation after the third equals sign in (E.3), and is transferred unchanged 
to F({m,}). This has the important effect that all the remaining terms in 
F({m,}) are independent of h, and due to the invariance of the exchange 
Hamiltonian (see Chap. 6) under the transformation {S;} — {—5/}, they are 
also even functions of the my. 

We can decompose f({m,}) into terms which depend only on one, two, 


Nz 
FLm}) = Am) +5 OE Omsm) to (E.7) 
v=1 PAV 


The Taylor expansion of the functions in (E.7) is given by 


filmy) = fi(0) + com? + cam$ +...— hm, (E.8a) 
and 
fo(my, Mp) = - Se 2KhyvMypMy +... . (E.8b) 
LV 


It then follows from (E.3) and (E.5) that 
rae ie < 
7 I a a . dm,e~PNIme}) (E.80) 
with 
b 
Fllme}) = Neful0) +3 (amd + Fmt +... — hm) 
1 
+ 52 Kuv(m,— my) +... (E.8d) 
Lv 


The coefficients f;(0),a,b and K,, are functions of T and Jy. The cells, like 
the original lattice, form a simple cubic lattice, which is the cell lattice with 
lattice constants a, and lattice vectors a,. Let N; be the number of lattice 
points (cells) in the direction i, whose product N;N2N3 must give N.; we 
then define the wavevectors with components 

207; 


Ni Ni 
ky = Neu. , where — oe <7 < 5: 


(E.9) 
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The reciprocal lattice vectors for the cell lattice are given by 
27 
g= 7 (ta aera) . (E.10) 


z 


The Fourier transform of the cell spins is introduced via 


1 . 
My = Ds efkav my (E.11a) 


1 ‘ 
my =— Siem ‘ (E.11b) 
The orthogonality and completeness relations of the Fourier coefficients 
are 
Soerer ey = Nodypy! (E.12a) 
k 


: j 1 fork—k’= 
yee jay — N.A(k _ k’) = N. OF : 8 ; (E.12b) 
7 O otherwise 


where g is an arbitrary vector of the reciprocal lattice. 


The transformation of the individual terms of the free energy is given by 


a ; m> =a ; MkMN_k 
Vv k 

b 4 b i(ki+...+k4)ay 
) my = 172 ; ) e Mk, +++ Mk, 
D c 


ie (E.13a) 


b 
= > A(ki est ka) Mx, MK, Mk3Mky ‘ 


hNS> my = AN\/Nemx=o 


Due to translational invariance, the interaction K,,, depends only on the 
separation, 


1 2 
3 Ke vim, = m4") 


. ON. ys » S> K (60's (a —, et25 me tka (1 _ e885) myer 
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Here, 6 =v —v’' was introduced, and 
k) = © K(5)(1 — coskas) = oes K (6) 2 sin? — (E.13b) 
5 


was defined. Due to the short range of the interaction coefficients (6), we 
can expand sin? 3s for small k in a Taylor series, and terminate the series 


after the first term. Taking the cubic symmetry into account, in d dimensions 
we find 


v(k) = 1g Kast +O(k*) . (E.13c) 


Then the partition function in Fourier space is 


Z=Zp (IL fem) exp — BN bx + ck?)mym_x 


k 


eR 


we 
TON, S- A(ki +.. -+k4) mu, ...m te — nV Nemo +-..] 


k,.. ka 
(E.14) 


where Zo is the part of the partition function which is independent of mx, as 
follows from (E.8c). 


Definition of [ dmx : 


Due to mi = m_x, (E.11a) can be written in the form 


My = d ye (ene (Rem, + tlm my) 


+e7"**” (Rem — iImmx)) 
etka tka, 
a EEE Fe E.15 
yf ee Waren) (E.15a) 
VN wm, ( v2 =. 
ikay __ ,—ikay 
+i SS (VEImmy)), 
J/2 eee, 
Y-k 


where the sums over k extend only over half of the k-space (HS). This is an 
orthogonal transformation 


sS- (ers 4 e kav) (on ce ae) x = Ok, k’ : (E.15b) 


Vv 
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and correspondingly for sinka,. The cross terms give zero. It follows that 


[Tlom.= J IL (V2dRe mx) (V2dIm mx) «. = f Tam. 


keHSs 
(E.15c) 
Clearly, from (E.13b), 
u(k) = v(—k) = v(k)* (E.15d) 
and 
S/ o(k)m_m_x = SS v(k) ((v2Re mx)” + (v2Im mx) ) 


k keHSs 


= Sov(k)yz .  (E.15e) 
k 


In the harmonic approximation, it follows from (E.15d), as will be verified in 
(7.4.47), that 


x U(K) [74/7 Oy 
(M_EM«!) = /(U amy) — mei = uk) F (E.16) 


Continuum limit v = at 0. 


If we consider wavelengths which are large compared to az, we take the 
continuum limit: 


my (E.17) 


m(x,) = 


* 


m(x) = 


eX my . (E.18) 
\/ N. = a. 


In the strict continuum limit, the Brillouin zone goes to oo. The terms in the 
Ginzburg-Landau functional are 


dm = jes m(x)? , ae: = v fate m(x)* , 
Dee = =, fas m(x) , 2 Fem = [ ate(vm()? 
[Tem = [ Plmool = [TI (vance) 


The functional integrals are defined by discretization. Then as our result for 
the Ginzburg-Landau functional, we can write 


(E.19) 


544 Appendix 


Fl[m(x)] = pee am + sm*(x) + c(Vm)” — hm(x) +...|  (E.20) 


which yields the partition function in terms of the following functional inte- 
gral: 


Z = Z)(T) i D[m(x)]e~ FF Ir] (E.21) 


(i) Here, we have redefined the coefficients once again; e.g. 77 was combined 
with h. The coefficient Zo(T) is found from the prefactors defined earlier, but 
it is not important in what follows. 

(ii) Owing to the fact that the trace is only partially evaluated, the coeffi- 
cients a,b,c and Zo(T) are “uncritical”, ie. they are not singular in T,J,... 
etc. 

(iii) In the following, we extend the integration range for (s dmy... = 
ee dm(x) > f°. dm(x), since m(x) is in any case limited by e-om” Tits 
most probable value is m(x) ~ \/>", and thus m, ~ /v\/¢ <1. 


(iv) General statements about the coefficients in the Ginzburg-Landau func- 
tional: 

a)F|m(x)] has the same symmetry as the microscopic spin Hamiltonian; 
i.e. except for the term with h, F[m(x)] is an even function of m(x). 

3) From the preceding rearrangement of the h term, it may be seen that 
a,b,c are independent of h. In particular, the partial evaluation of the trace 
produces no higher odd terms. 

7) Stability requires b > 0. Otherwise, one cannot terminate at m+. At the 
tricritical point, b = 0 and one must take the term of order m® into account, 
also. 

6) The ferromagnetic interaction favors parallel spins, ie. nonuniformity of 
spin direction costs energy. Thus cVmVm with c > 0. 

€) Concerning the temperature dependence of the G.-L. coefficient a, we refer 
to the main part of the text, Eq. (7.4.8). 

(v) In the a limit, the linear dimension L — ov, 


(x) “we “= Za en 


keB 


and m(x ees —> ff. BY e’k*m(k), where the integral is extended over the 
whole seh zone fs ky € [- aS =], and m(k) = L4/2m x. 


Later, the integral over the cubic Brillouin zone will be approximated by an 
integral over a sphere: 
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F. The Transfer Matrix Method 


The transfer matrix method is an important tool for the exact solution of 
statistical-mechanical models. It is particularly useful for two-dimensional 
and one-dimensional models. We introduce the transfer matrix method by 
solving the one-dimensional Ising model. The one-dimensional Ising model 
for N spins with interactions between nearest neighbors is described by the 
Hamiltonian 


N N 
H=—-JS oj;0;41-HY o;, (F.1) 
j=l g=l 


where periodic boundary conditions are assumed, ony 41 = 01. The partition 
function (K = BJ, h = GH) has the form 


N 
Give 3 [Lek itt t Bes tesey) =Tr (E") . (F.2) 
{o,=+1} j=1 


Following the second equals sign, the transfer matriz, defined by 
Trot = ekert3(ote') | (F.3) 
was introduced. Its matrix representation is given by 
K+h ,-K 
e e€ 
T= ("1 cen)» (F.4) 


One readily finds the two eigenvalues of this (2 x 2) matrix: 


Ai,2 = e* cosh h + (exe + e°* sinh*h) Ve , 


(F.5) 


The trace in (F.2) is invariant under orthogonal transformations. By trans- 
forming to the basis in which T is diagonal, one can verify that 


Zn =X ES (F.6) 
The free energy per spin is given by the logarithm of the partition function 
1 
=— im — é F. 
f(T, A) kT lim 4 los 2y (F.7) 
In the thermodynamic limit, N — oo, the largest eigenvalue dominates: 
f =—kTlogd, (due to 1 > re for all T > 0). (F.8) 


There is no phase transition in one dimension, since 
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f(T, 0) = —kT [log 2 + log(coshZJ)] (F.9) 


is a smooth function for T > 0. Owing to the short range of the interactions, 
disordered spin configurations (with high entropy S) are more probable than 
ordered configurations (with low internal energy F) in equilibrium (where 
F(T,0) = E—TS is a minimum). The isothermal susceptibility x = (3%), 
for H = 0 is found from (F.8) to be x = Be?97 at low T. There is a pseudo- 


phase transition at T = 0: m3 = 1, yo = 0. 


Magnetization: m = —Of/0B Specific Heat: Cy = —T0? f /OT? 


Cu=0 


J/k T 


Fig. F.1. The magnetization and specific heat in the one-dimensional Ising model 


The spin correlation function can also be expressed using the transfer 
matrix and computed: 


1 
(nol) Nn => S- e FX oo; 


1 Toy on Tk opens «++ Loy_101 


I 
| 
LM 
a 
3 


{oi=+1} 
Ca 1 Ue mn (F.10) 
1 
=F. (x+| Daee ike t Ix+) 
ZnS 
-1 l-k N-I+k _f10 
=Zy Tr (t,T° “7,7 ), z= 0-1)? l>k. 
To distinguish them from o; = +1, the Pauli matrices are denoted here by 


Tx,y,z- Lhe trace in the last line of (F.10) refers to the sum of the two diagonal 
matrix elements in the Pauli spinor states y+. Further evaluation is carried 
out by diagonalizing T: 


ie ) 4 , where r==(2 i, 


0 A» -11 
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With 


01 
-1_ ores 
a cs (10) = Te 


it follows from (F.10) that 
(or01) y = Zig Mt eel ye mez 


and thus for! —k < N in the thermodynamic limit N — oo we obtain the 
final result 


(ox01) = (2) ” (F.11) 


For T > 0, Ao < Ai, ie. (o~01) decreases with increasing distance | — k. For 
T — 0, A1 > Ae (asymptotic degeneracy), so that the correlation length 
E— ©. 

By means of the transfer matrix method, the one-dimensional Ising model 
is mapped onto a zero-dimensional quantum system (one single spin). The 
two-dimensional Ising model is mapped onto a one-dimensional quantum sys- 
tem. Since it is possible to diagonalize the Hamiltonian of the latter, the 
two-dimensional Ising model can in this way be solved exactly. 


G. Integrals Containing the Maxwell Distribution 


[ee (AE) Pw =n(san)”” ( sum) [eve 


m/2kT 
HteT\e 2 pues = 1,2, 
2 
fe SF j= SnkT (G.1d) 
2 
je () fo(v) = Pale)? (G.1e) 
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kT 
[ee URViV; UF? (V) = A(Seidjt + On; Si + uVdiz) » A= a. (G.1g) 


Eq. (G.1g) can be demonstrated by first noting that the result necessarily 
has the form given and then taking the sum }?,_; )/j;—): comparison with 
(G.1e) leads using f d?v (v2)? f°(v) = 15, to the result A= ©. 


H. Hydrodynamics 


In the appendix, we consider the microscopic derivation of the linear hydro- 
dynamic equations. The hydrodynamic equations determine the behavior of 
a system at low frequencies or over long times. They are therefore the equa- 
tions of motion of the conserved quantities and of variables which are related 
to a broken continuous symmetry. Nonconserved quantities relax quickly to 
their local equilibrium values determined by the conserved quantities. The 
conserved quantities (energy, density, magnetization...) can exhibit a time 
variation only by flowing from one spatial region to another. This means 
that the equations of motion of conserved quantities E(x) typically have the 
form E(x) = —Vjp(x). The gradient which occurs here already indicates 
that the characteristic rate (frequency, decay rate) for the conserved quan- 
tities is proportional to the wavenumber gq. Since jg can be proportional to 
conserved quantities or to gradients of conserved quantities, hydrodynamic 
variables exhibit a characteristic rate ~ q”, i.e. a power of the wavenumber q, 
where in general « = 1,2. In the case of a broken continuous symmetry there 
are additional hydrodynamic variables. Thus, in an isotropic antiferromag- 
net, the alternating (staggered) magnetization N is not conserved. In the 
ordered phase, its average value is finite and may be oriented in an arbi- 
trary direction in space. Therefore, it costs no energy to rotate the staggered 
magnetization. This means that microscopic variables which represent fluc- 
tuations transverse to the staggered magnetization likewise belong to the set 
of hydrodynamic variables, Fig. H.1. 


E(x) 


a i 


(a) Fig. H.1. Conserved 
quantities and broken- 


N@) symmetry variables: (a) 
the energy density E(x) 
and (b) the alternating 
(staggered) magnetization 
N 

(b) 


ey 


(x) in an isotropic or 
planar antiferromagnet 
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H.1 Hydrodynamic Equations, Phenomenological Discussion 


In order to gain some insight into the structure of the hydrodynamic equa- 
tions, we first want to consider a simple example: the hydrodynamics of a 
ferromagnet, for which only the magnetization density is conserved. The mag- 
netization density M(x) obeys the equation of continuity 


M(x) = —Vjm(x) - (H.1) 


Here, ji is the magnetization current density. This becomes larger the greater 
the difference between magnetic fields at different positions in the material. 
From this fact, we obtain the phenomenological relation 


Jau(x) = —AVHA(x) , (H.2) 
where X is the magnetization conductivity. The local magnetic field depends 
on the magnetization density via the relation 

1 
H(x) = —M(x), (H.3) 
x 


in which the magnetic susceptibility . enters. Inserting (H.3) into (H.2) and 
the latter into (H.1), one finds the diffusion equation 


M(x,t) = DV?M(x,t) , (H.4) 
where the magnetization diffusion constant is defined by 
r 
of 


D= 


To solve (H.4), it is expedient to apply a spatial Fourier transform; then the 
diffusion equation (H.4) takes on the form 


Mg =—D@Mgq,; (H.5) 
with the obvious result 
Mg(t) = e724 *M,(0) . (H.6) 


The diffusive relaxation rate Dq? decreases as the wavenumber becomes 
smaller. For several variables X§, whose deviations from equilibrium are de- 
noted by 6 (XS) the hydrodynamic equations have the general form 


o 5(X%) + M@'(q) 5(X2) =0. (H.7) 


Here, M ce’ (q) is a matrix which vanishes as q > 0. For the hydrodynamics 
of liquids, we recall Eq. (9.4.46a-c). 
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H.2 The Kubo Relaxation Function 


In linear response theory, one investigates the effects of an external force 
F(t) which couples to the operator B. The Hamiltonian then contains the 
additional term 


H'(t)=—F()B. (H.8) 


For the change of the expectation value of an operator A with respect to its 
equilibrium value, one obtains to first order in F(t) 


say) = [ ” dxaplt— UE) (H.9) 


with the dynamic susceptibility 


xaa(t—t') = -O(t— tA), BO) | (1.10) 
Its Fourier transform reads 
xaB(w) = i diety ap (t) . (H.11) 


We now consider a perturbation which is slowly switched on and then 
again switched off at the time t = 0: F(t) = e“O(—t)F. One then finds from 
(H.9) 


5(A(t)) = / * atx an(t —t)FO(—t")e 


—oco 


a / duy ap(u) Fe“) , (H.12) 
t 


where the substitution t — t/ = u has been employed. The decay of the 
perturbation for t > 0 is thus described by 


5(A(t)) = dan(t)Fe , (H.13) 


where! the Kubo relaxation function ¢ap(t) is defined by 


baal = 5 [ dlA), BOE“ (H.14) 


t 


Its half-range Fourier transform is given by 


ban(w) = i: * die ban(t). (1.15) 


4 QM IL, Sect. 4.3 
' The factor e“ is of no importance in (H.13), since dag(t) relaxes faster. 
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The Kubo relaxation function has the following properties: 


dap(t = 0) = vyap(w =0), (H.16) 
oan(t)=—xan(t) fort >0, (H.17) 
daB(w) = = (xaB(w) ~ xan(v = 0) : (H.18) 


Eq. (H.16) follows from the comparison with the Fourier transformed 
dynamical susceptibility, Eq. (H.11). The second relation is obtained imme- 
diately by taking the derivative of (H.14). The third relation can be obtained 
by half-range Fourier transformation of (H.17) 


“ | dtey49(t) = | dtob an(t) 
0 0 


= eb aa(é)|p — iw | dte*dan(t) = dan(t = 0) —iwbanw) 
0 


and application of d4B(t = co) = 0, (H.16) and 


i dt ety 4p(t) = F dte'y p(t) = XaB(w) . 
0 —0oo 


Further, one can show for t > 0 that 


, i 


ban) = [a's (AC), BO] = -F([A@, BO)]) = —xanl) 


bapw) = —xXaBlw) (H.19) 
and, together with (H.18), 

whaB(w) =idjp(w) +ixap(w =0). (H.20) 
Later, we will also require the identity 


i f* . i 
xine == 5 f° at((AW), BO']) = -F([A@), BO)']) , G20 
0 
which follows from the Fourier transform of (H.10) and the fact that the 
expectation value ([A(oo), B(0)']) vanishes. 
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H.3 The Microscopic Derivation of the Hydrodynamic 
Equations 


H.31 Hydrodynamic Equations and Relaxation 


We introduce the following notation here: 


X" (x,t) i=1,2,... densities (Hermitian) 
; 1 / . , 

Xi(t) = —= | Pre 'i*X'*(x,t Fourier transforms H.22a 
q(t) W (x, ¢) ( ) 
a 1 iqx yi t t 

X?(x,t) = Wi STE XL@) aoe (H.22b) 

q 
x(a, t) = Xx: xi (O) ete. 


Conserved densities are denoted by indices c,c’,... etc. and nonconserved 
densities by n,n’,.... 

We now consider a perturbation which acts on the conserved densities. 
At t = 0, it is switched off, so that the perturbation Hamiltonian takes on 
the form 

H!=- / daX°(x,t)K°(x)O(-t)e* = —— X°,(t)K5O(-te* 

a 


and leads according to Eq. (H.13) to the following changes in the conserved 
quantities for t > 0: 


5(XE(t)) = 6° (a, t)KGe" (H.23) 
The decay of the perturbation is determined by the relaxation function. 

The situation considered here is, on the other hand, also described by the 
hydrodynamic equations (H.7) 
eh) = 0s (H.24a) 
If we insert Eq. (H.23) into (H.24a), we obtain 

co! O ce! eel! cl 

{oS + Me'(a) bo" (GK Gg" = 0. 

Since this equation is valid for arbitrary K’ ce it follows that 


, O , Tid P 
{oe tM (a) }o° e"(g,t) =0. (H.24b) 


From this, we find ¢°°" (q,w) by taking the half-range Fourier transform 
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a : , 0 , aw 
| delet se — 4. Me (abo (q,t) =0 
0 ot 
and carrying out an integration by parts, i.e. 
— 9°" (q,t = 0) + { — wd" + M*(q) }4°"C" (G,w) = 0. 


Using (H.16), we obtain finally 


” 


{ —iwa’ + u"(a) bo?" (a,w) = x" (a). (H.24c) 


Therefore, the relaxation functions °° (q,w) and thus the dynamic response 
functions are obtained from hydrodynamics for small q and w. 

If, conversely, we can determine oe (q,w) for small q and w from a micro- 
scopic theory, we can then read off the hydrodynamic equations by comparing 
with (H.24a) and (H.24c). 

We consider an arbitrary many-body system (liquid, ferromagnet, antifer- 
romagnet, etc.) and divide up the complete set of operators X, f into conserved 
quantities Xg and nonconserved quantities Xg. Our strategy is to find equa- 
tions of motion for the X4, where the forces are decomposed into a part which 
is proportional to the Xg and a part which is proportional to the Xg. The 
latter fluctuate rapidly and will lead after its elimination to damping terms 
in the equations of motion, which then contain only the Xq. 

In order to visualize this decomposition in a clear way, it is expedient to 
choose the operators in such a manner that they are orthogonal. To do this, 
we must first define the scalar product of two operators A and B: 


(A|B) = Xa,pi(w = 0). (H.25) 


Remark: one can readily convince oneself that this definition fulfills the properties 
of a scalar product: 
(A|B)* = (BIA) 
(ce, A1 oe C2 A2|B) = Cj. (Ai|B) + C2 (A2|B) 
(A|A) is real and (A|A) > 0 (0 only for A=0) . 


We now choose our operators to be orthonormalized: 
(X§|X2) = x7 (a, =0) = 6 . (H.26) 


To construct these operators, one uses the Schmidt orthonormalization pro- 
cedure. 

The Heisenberg equations of motion X. a= ;[H, XQ] etc. can now be written 
in the form 


Xo = -i0% (q)XE -iC(q)X" (H.27a) 
X® = -i0"*(q)X¢ — iD" (q)X® . (H.27b) 


554 Appendix 


Here, the derivatives xe and Xe were projected onto x and Die If we 
take e.g. the scalar product of xe with XE: we find using Eq. (H.21) 


(XE xe) = ~iC (q) = —lpxe, xt) 


A 
That is: 
cc! 1 c c 
and analogously 
cn 1 c n nn’ 1 n n’ 


These coefficients obey the following symmetry relations: 
cr"*(q) = CF4(q) , C*(q) = CQ) , D”*(q) = DQ). (29) 


It thus follows from (H.20) that 


wo (q,w) = C%" (q)¢* * (q,w) + C°"(q)o* (a, w) + 15° 

Oe Ge) = (a)o" “(a,w) + D™™ (q)¢”“(a,w) (30) 
wo" (q,w) = C°(q)$ (q,w) + D"™ (q)o""™ (q,w) +16" 
wo"(q,w) = C% (q)o°"(q,w) + 0% (q)o" (a, w) . 


From (H.30b) we read off the result 
o"*(4,w) = (WI ~ DEA))nw CO” * (a) 6° (a, w) : (H.31) 
then inserting (H.31) into (H.30a) leads to 


1 


. 5°" — O°" (q) — 0"(q) (a5 


) era] 0° (au) 
= is". (H.32) 


For the conserved quantities, the coefficients C°“ (q) and C°"(q) vanish in 
the limit q — 0. Therefore, in the limit of small q, we find 


i (C™' (a)Xq + C™(a) XG) = igasé(a) . (H.33) 
We define also the nonconserved part of the current density 


C(q)X? = gaJi(a) - (H.34) 


In contrast to (H.33) and (H.34), the Dn» (q) remain finite in the limit q — 0. 
For the behavior at long wavelengths (q — 0), we can therefore take > 
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in the limit w,q — 0, whereby due to the finiteness of D(q), we can expect 
that 


1 1 
iyi ae, te ee ee H. 
w0q—0 wil — D(q)  a7000 wil — D(q) eee 


In the limit q — 0, we can find a relation between 5 from Eq. (H.30c) 


and a correlation function. Owing to limgoC”°(q) = 0, it follows from 
(H.30c), with the abbreviation D = limg-.o D(q), that 


(wil — D) lim ¢” ™ (q,w) = id" 


or 


1 
(=) same” ‘(q,w) . (H.36) 


Inserting this into (H.32) and taking the the double limits, we obtain 


(ws - c°'(q) +iC°"(q ) (Jim lim gn ‘(q ,w) ore (a) ) 9°" (q,w) 


w—0q—0 
= jgce" 
ie. finally: 
(ws — 0% (a) + igaaal es ) °°" (as) = id" (H.37) 
with the damping coefficients 
= = lim lim Pjc jg (4) ; (H.38) 


w—0 q—0 


Here, the sums over n and n’ were combined into the nonconserved current 
densities defined in Eq. (H.34). 

When the system exhibits symmetry under reflection, rotation, etc., the 
number of nonvanishing coefficients I" fe can be reduced. We assume that for 


the remaining functions Pje ejg the operators j, and j~ have the same sig- 


nature’® under time reversal: €;, = €;,,. Applying (H.18) and the dispersion 


relations!’, one obtains 
. i 
a GS = x" (w) 
ow) = ->(x"(e) — x) + = 
1 / iN 
__ip »_X"(w') x"(w) W 
T | (w! — w)w us p. (Ee) 
Spe > (w) 


16 QM I, Sect. 4.8.2.2 
1” QM I, Sect. 4.4 
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which, owing to the fluctuation-dissipation theorem!® and the antisymmetry 
of x'"(w), finally leads to limo ¢(w) = limo 3G? (w) and 


ec’ i 7 ‘ * iwt /=c ~¢! 
ob = pp dim, lim f  dte™"Geq(t)35-q(0)) - (H.40) 


Co 


This is the Kubo formula for the transport coefficients, expressed in terms 
of current-current correlation functions. Without taking up their straightfor- 
ward proofs, we mention the following symmetry properties: 


/ y 
cece _ pec 
ap = [3a , 


PESO, 1G Se teal, (H.41) 


In summary, one can read off the following linear hydrodynamic equations by 
comparison with Eqns. (H.24c) and (H.24a): 


fa) , yi , , 

Hoe" + 10% (a) + anaes | (XG (0) = 9, (H.42a) 
cc! 1 Cc c! 
Cc (a) = 5 (XG XE q)) | (H.42b) 
r(q) = seer lich dt el ({ 72 (t),9° q(0)}) - (H.42c) 


4kT w-0q>0 J_y, 


The elements of the frequency matrix C°(q) ~ q (or q?) are functions of 
expectation values of the conserved quantities and the order parameters and 
susceptibilities of these quantities. They determine the periodic, reversible 
behavior of the dynamics. For example, for a ferromagnet, the spinwave 


frequency (H.42b)follows from w(q) = wr «x q*, where M is the magneti- 


zation and xq x q~? the transverse susceptibility. The damping terms re- 
sult from the elimination of the nonconserved degrees of freedom. They can 
be expressed via Kubo formulas for the current densities. For the deriva- 
tion it was important that the nonconserved quantities have a much shorter 
time scale than the conserved quantities, which also permits taking the limit 
lim,,.9 limg_.9. We note the similarity of this procedure to that used in the 
case of the linearized Boltzmann equation (Sect. 9.4). The present deriva- 
tion is more general, since no constraints were placed on the density or the 
strength of the interactions of the many-body system. 
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I. Units and Tables 


In this Appendix we give the definitions of units and constants which are used in 
connection with thermodynamics. We also refer to the Table on page 562. 


Conversion Factors 


LleV = 1.60219 x 10-8 J 
1N = 10° dyn 
1J = 1x10" erg 
1G = 2.997925 x 10° esu = 2.997925 x 10° \/dyn cm? 
1K 4. 0.86171 x 107* eV 
leV 4. 2.4180 x 10'4* Hz A 1.2399 x 1074 cm 
1T = 104 Gauss (G) 
1A = 10cm 
lsec = 1s 
Pressure 


1 bar = 10°dyn/em? = 10°N/m? = 10°Pa 
1 Torr = 1 mm Hg 
Physical Atmosphere: 
1 atm = air pressure at 760 mm Hg = 760 Torr = 1.01325 bar 


This relation between Torr and bar follows from the mass density of mercury pug = 
13.5951g cm~? at 1°C and the acceleration of gravity g = 9.80655 x 10?cm s~”. 
Technical Atmosphere: 


1 at = 1 kp/cm? = 0.980655 bar 


Temperature 


The absolute temperature scale was defined in Sect. 3.4 using JT; = 273.16 K, the 
triple point of H2O. 

The zero point of the Celsius scale 0°C lies at 273.15 K. Thus in this scale, 
absolute zero is at —273.15°C. With this definition, the equilibrium temperature of 
ice and water saturated with air under a pressure of 760 mm Hg = 1 atm is equal 
to 0°C. 


Table I.1. Fixed points of the international temperature scale: 


0°C ice point of water 
100°C equilibrium temperature of water and water vapor 
—182.970°C boiling point of oxygen 
444.600°C boiling point of sulfur 
960.8°C _ solidification point of silver 
1063.0°C _ solidification point of gold 
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For a comparative characterization of materials, their properties are quoted at 
standard temperatures and pressures. In the physics literature, these are 0°C and 
1 atm, and in the technical literature, they are 20°C and 1 at. 

Physical Standard State = standard pressure (1 atm) and standard temperature 
(0°C). 

Technical Standard State = 1 at and 20°C. 

Density of Hz at T; and P = 1 atm: 


p = 8.989 x 10° *g/Liter = 8.989 x 10°-°g cm? . 
Molar volume under these conditions: 


2.016 g 


V; ens. 
“~ 8.989 x 10-2 g Liter 


= 22.414 Liter ( 22.414 ) 
mole 


1 mole 4 atomic weight in g (e.g. one mole of Hz corresponds to a mass of 2.016 g). 


PV _ latm Vy 
NT L x 273.16 K 


Loschmidt’s number = Avogadro’s number: 


k= = 1.38065 x 10° 'erg/K . 


L = Na = number of molecules per mole 
2.016g _ 2.016 
mass Hp 2 x 1.6734 x 10724 


= 6.02213 x 107°. 


Energy 
The unit calorie (cal) is defined by 


1 cal = 4.1840 x 10° erg = 4.1840 Joule . 


A kilocalorie is denoted by Cal (large calorie). With the previous definition, 1 Cal 
up to the fourth place past the decimal point has the meaning 


1 Cal = 1 kcal = 1000 cal 


= the quantity of heat which is required to warm 1 kg H2O at 1 atm from 14.5 to 
ieee Oa 


1J=1Nm=10" dyn cm = 10° erg. 


Power 


1W=1VA=1Js'=10’ergs 
1HP=75 kpms =75 x 9.80665 x 10°dyn m s-* = 735.498 W. 

The universal gas constant R is defined via Loschmidt’s/Avogadro’s number by 
R= Nak = 8.3145 x 10"erg mol"'K~" . 


Using the gas constant R, one can write the equation of state of the ideal gas in 
the form 


PV =nRT , (1.1) 
where n is the amount of matter in moles (mole number). 


19 Note that the nutritional values of foods are quoted either in kJ or in kilocalories. 
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We close this section with some numerical values of thermodynamic quantities. 
Table 1.2, below, gives values of specific heats (Cp). 

As can be seen, the specific heat of water is particularly large. This fact plays 
an important role in the thermal balance of Nature. Water must take up or release 
a large quantity of heat in order to change its temperature noticeably. Therefore, 
the water of the oceans remains cool for a relatively long time in Spring and warm 
for a relatively long time in Autumn. It therefore acts in coastal regions to reduce 
the annual temperature fluctuations. This is an essential reason for the typical 
difference between a coastal climate and a continental climate. 


Table I.2. The Specific Heat of Some Materials under Standard Conditions 


Specific heat C Molecular Molar 
weight heat capacity 
[cal K7* g7*] [cal K~' mole~*] 
Aluminum 0.214 27.1 5.80 
Tron 0.111 55.84 6.29 
Nickel 0.106 58.68 6.22 
Copper 0.091 63.57 5.78 
Silver 0.055 107.88 5.93 
Antimony 0.050 120.2 6.00 
Platinum 0.032 195.2 6.25 
Gold 0.031 197.2 6.12 
Lead 0.031 207.2 6.42 
Glass 0.19 — — 
Quartz Glass 0.174 — a 
Diamond 0.12 = a 
Water 1.00 = — 
Ethanol 0.58 — — 
Carbon Disulfide 0.24 cs — 


Table 1.3. Expansion Coefficients of Some Solid and Liquid Materials in K~* 


linear volume 
Lead 0.0000292 Diamond 0.0000013 Ethanol 0.0011 
Tron 120 Graphite 080 Ether 163 
Copper 165 Glass 081 Mercury 018 
Platinum 090 Quartz Crystal | axis 144 Water 018 
Invar (°*Fe+3°Ni) 016 Quartz Crystal || axis 080 
Quartz Glass 005 


The linear expansion coefficient a; is related to the volume or cubic expansion 
coefficient in (3.2.4) via 


a= 3a1 a 
This follows for a rectangular prism from V + AV = (a+ Aa)(b+ Ab)(c + Ac) = 


abe (1+ 44 + 424 4°) + O(A’), thus 4¥ = 344% under the assumption of 
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isotropic thermal expansion as found in isotropic materials (liquids, amorphous 
substances) and cubic crystals. 


Table I.4. Some Data for Gases: Boiling Point (at 760 Torr), Critical Temperature, 
Coefficients in the van der Waals Equation, Inversion Temperature 


rere Boiling point T.{K] a [atm cm") b ES Time = 227K] 


in K mole” mole 4 
He 4.22 5.19  0.0335x 10° 23.5 35 
He 20.4 33.2 0.246 x10° 26.7 224 
No 77.3 126.0 1.345 x10° 38.6 850 
Oz 90.1 154.3 1.36 ~x10° 31.9 1040 
CO2 194.7 304.1 36 x10° 42.7 2050 
Table I.5. Pressure Dependence Table 1.6. Heats of Vaporiza- 
of the Boiling Point of Water tion of Some Materials in cal-g~! 
Pressure Boiling Point Ethyl Alcohol 202 
in Torr in °C Ammonia 321 
720 98.49 Ether 80 
730 98.89 Chlorine, Cle 62 
740 99.26 Mercury 68 
750 99.63 Oxygen, O2 51 
760 100.00 Nitrogen, N2 48 
770 100.37 Carbon Disulfide 85 
780 100.73 Water 539.2 
790 101.09 Hydrogen, Hz 110 
800 101.44 


Table I.7. Heats of Melting of Some Materials in cal- g~* 


Aluminum 94 Silver 26.0 
Lead 5.5 Table Salt 124 
Gold 15.9 Water (Ice) 79.5 


Copper 49 
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Table 1.8. Vapor Pressure of Table 1.9. Vapor Pressure of Iodine in 
Water (Ice) in Torr Torr 

—60°C 0.007 —48.3°C 0.00005 

—40°C 0.093 —32.3°C 0.00052 

—20°C 0.77 —20.9°C 0.0025 

+0°C 4.6 0°c 0.029 

+20°C 17.5 15°C 0.131 

+40°C 55.3 30°C 0.469 

+60°C 149.4 80°C 15.9 

+80°C 355.1 114.5°C — 90.0 (melting point) 


+100°C 760.0 185.3°C 760.0 (boiling point) 
+200°C = 11665,0 


Table 1.10. Freezing Mixtures and Other Eutectics 


Constituents Eutectic 

with Melting Points Temperature in °C Concentration 
NH,Cl Ice (0) -15.4 
NaCl Ice (0) -21 29/71 NaCl 
Alcohol Ice (0) -30 
CaCl2:6H20 Ice (0) -55 
Alcohol CO2(-56) -72 
Ether CO2(-56) -77 
Sn (232) Pb (327) 183 74/26 
Au (1063) Si (1404) 370 69/31 


Au (1063) Tl (850) 131 27/73 
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Subject Index 


absolute temperature, 91 
absolute zero, 513 

acoustic resonances, 466 
activation energy, 425 

adiabatic change, 515 

adiabatic equation, 102 

— of the ideal quantum gas, 174 
allotropy, 520 

ammonia synthesis, 155 
amplitude ratios, 372 
anharmonic effects, 211 
antiferromagnet, 336, 337, 548 
antiferromagnetism, 287-288 
Arrhenius law, 425 

atmosphere, 167 

average value, 65 
average-potential approximation, 244 
Avogadro’s number, 92, 558 


background radiation, 203, 508 

barometric pressure formula, 53, 72, 
414 

barrier, see reaction rates 

BBGKyY hierarchy, 442 

Bernoulli numbers, 228, 537 

Bethe lattice, 393-398 

— percolation threshold, 394 

Bethe—Peierls approximation, 327 

binary alloys, 517 

black body, 203 

black holes, 188, 508 

black-body radiation, 198, 203 

Bloch equations, 429 

— in the ferromagnetic phase, 435 

block-spin transformation, 359, 360 

Bohr magneton, 269 

Bohr—van Leeuwen theorem, 276 

boiling boundary, 133 

boiling point 

— numerical values for some materials, 
560 


boiling-point elevation, 263, 265 

Boltzmann constant, 36, 92 

— experimental determination of, 92, 
414 

Boltzmann equation, 437-475 

— and irreversibility, 443, 445, 505 

derivation of, 438-443 

linearized, 455-468 

— symmetry properties of, 440, 444, 
476 

Boltzmann’s entropy, 480, 498-499 

— in the urn model, 512 

bond percolation, 389 

Bose distribution function, 171 

Bose-Einstein condensation, 190-197, 
223 

Bose-Einstein statistics, 170 

bosons, 170 

— second virial coefficient, 532 

Bravais crystal, 208 

Brillouin function, 281 

Brownian motion, 409-410 

— in a force field, 414 

— in the limit of strong damping, 414 

— microscopic model of, 484-490, 510 

— of a sphere in a liquid, 414 

bubble point line, 165 

Buckingham potential, 240 

bulk viscosity, 465, 477 


| 
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canonical momentum, 269 
canonical variables, 80 
Carnot cycle, 126, 162 

— efficiency of, 127 

— inverse, 127 

catalyst, 154 

cavity radiation, 198, 203 
Cayley tree, see Bethe lattice 
central limit theorem, 7, 9 
characteristic function, 5 
chemical constants, 152, 235 
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chemical potential, 45, 64 

— of a photon gas, 205 

— of an ideal Bose gas, 195 

— of an ideal Fermi gas, 182 

chemical reactions, 150-155, 422-425 

— rates of , see reaction rates 

circuit, electrical, 434 

classical limit of quantum statistics, 
521-526 

Clausius principle, 105 

Clausius’ equation of state, 247 

Clausius—Clapeyron equation, 134-139, 
163, 323 

cluster, 388 

cluster number, 391 

cluster radius, 399 

coefficient of expansion (thermal), 84, 
213 

coefficients 

— stoichiometric, 150 

coexistence curve 

— in the van der Waals theory, 253, 
256 

coexistence region, 131, 298, 376 

— in the van der Waals theory, 250 

coexistence singularities, 376 

collective degrees of freedom, 425 

collective density excitations, 215 

collision duration, 437 

collision operator 

— linear, 457 

—-— eigenfunctions of, 458, 468 

collision term, 439, 468, 478 

— linearized, 456 

collision time, 410, 437, 462, 476 

collisional invariants, 448-449, 457, 458 

components, 130 

compressibility, 83, 88 

— adiabatic, 83, 454 

— and particle-number fluctuations, 90 

— in the van der Waals theory, 254 

— isentropic, 83 

— isothermal, 83, 88, 90, 467 

—— at absolute zero, 515 

compression 

— adiabatic, 126 

— isothermal, 126 

compressional viscosity, see bulk 
viscosity, 477 

concentration, 257 

condensation boundary, 133 

conditions for equilibrium, 122 


configuration-space transformations, 
346 

conservation 

— of energy, 450 

— of momentum, 450 

— of particle number, 450 

conserved quantities, 26, 553 

— in the Boltzmann equation, 447-451 

constraints, 97, 105, 494 

continuous symmetry, 373 

continuum percolation, 388 

cooling efficiency, 128 

correlation function, 300-301, 306-307, 

367, 370 

longitudinal, 371, 375, 377, 407, 408 

Ornstein—Zernike, 303 

radial, 393 

— transverse, 371-374 

correlation length, 302, 307, 339, 367, 
390, 399 

— critical exponent of, 401 

correlation time, 410 

correlations, 6, 304 

corresponding states, law of, 251 

Coulomb interaction, 184 

coupling coefficients, see coupling 
constants 

coupling constants, 345, 346, 348, 
351 

critical dimension, 373, 385 

critical dynamics, 425-429, 468 

critical exponent 

— dynamic, see dynamic critical 
exponent 

critical exponents, 299, 334, 340, 354 

— correlation length, 339 

— dynamic exponents, 391 

— for the specific heat, 334 

— logarithmic divergence, 335 

— of a ferromagnet, 336 

— of a liquid, 336 

— of the correlation function, 343 

— of the van der Waals theory, 255 

— scaling laws, 341, 344 

—— hyperscaling relation, 344 

scaling relations, 343 

tables, 255, 336, 387 

critical isotherm, 336 

— in the van der Waals theory, 254 

critical opalescence, 255, 304, 343, 468 

critical phenomena, 331 

critical point, 132, 332, 334, 426 

— in the van der Waals theory, 251-257 
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critical slowing down, 428 

critical temperature 

— numerical values for some materials, 
560 

crossover, 386, 405 

cumulant, 8, 244, 377 

Curie law, 282 

Curie temperature, 287, 292 

Curie-Weiss law, 294 

current density, 444 

cyclic process, 107, 125-130 

— Carnot, 126, 161 

Diesel, 162 

general, 128 

Joule, 162 

Stirling, 163 
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damping, 414, 461, 489 

— see also friction 

— by a bath of harmonic oscillators, 
484, 510 

damping coefficient (hydrodynamic), 
555 

damping term, 464 

de Haas—van Alphen oscillation, 287 

Debye approximation, 210 

Debye frequency, 211 

Debye’s law, 210 

decimation procedure, 349 

decimation transformation, 346, 356, 
359, 400 

degree of dissociation, 153 

degree of polymerization, 320 

demagnetizing factor, 308 

demagnetizing field, 308 

demixing transition 

— binary liquids, 337 

density 

— of normal fluid, 220 

— of states, 37 

— superfluid, 220 

density matrix, 14, 35 

— canonical, 51 

—— magnetic, 271 

— grand canonical, 64 

— in the molecular field approximation, 
291 

— microcanonical, 29 

density of states, 183, 209, 211, 285, 
287 

— of free electrons, 183 

— of phonons, 209, 211 

density operator, see density matrix 


Subject Index 567 


density-density correlation function, 
467, 477 

deviation 

— of particle number, 90 

deviation, relative, 9, 41, 42 

dew formation, 160 

dew point line, 165 

diamagnetism, 278-279 

Diesel cycle, 162 

diffusion constant, 413 

diffusion equation, 413, 421, 433, 549 

— for temperature, 463 

dilute solutions, 257-266 

— chemical potential of, 258 

— free enthalpy of, 260 

— pressure, 257 

dipole interaction, 278, 307-317 

direction, 156 

discrete symmetry, 338 

dissipative systems, 387 

distortive transition, 336, 337 

distribution 

— binomial, 17 

— Poisson, 17 

distribution function, 9, 11 

— canonical, 51 

— derivation from the density matrix, 
521 

— grand canonical, 65 

— microcanonical, 27 

domains, 298, 316-317, 404 

dual lattice, 400 

duality transformation, 400 

Dulong—Petit law, 210 

dynamic critical exponent, 428 

dynamic susceptibility, 550 


effective exponents, 386 

effective mass, 186 

efficiency, 129, 163 

Ehrenfest’s classification, 332 
Einstein relation, 411—412 

elastic transition, 337 

electron gas 

— in solids, 185, 531 

electronic energy, 227, 230 
empirical temperature, 91 
endothermic reaction, 156 

energy 

— canonical free, 271 

— free, see free energy 

— internal, 75, 275 

— rotational, see rotational energy 
translational, see translational energy 
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— units, 558 
— vibrational, see vibrational energy 
energy conservation, 441 
energy density, 447 
— equation of motion for, 461 
— spectral, 201 
energy levels, spacing of, 3, 37 
energy shell, 26 
— fora spin-3 paramagnet, 34 
— for Glassigal ideal gas, 30 
— of harmonic oscillators, 33 
— surface area? (F), 27 
— volume, 27 
— volume inside, 29 
ensemble, 3, 9 
— canonical, 50-63 
— grand canonical, 63-68 
— microcanonical, 26-30 
—— magnets, 271 
— mixed, 15 
— pure, 14 
— table of, 67 
ensemble average, 497 
enthalpy, 77 
— free, see free enthalpy 
entropy, 35, 59 
— additivity of, 60 
— and Nernst’s theorem, 513-521 
— Boltzmann, see Boltzmann’s entropy 
— canonical, 54 
— extremal property of, 36 
— Gibbs, see Gibbs’ entropy 
— grand canonical, 65 
— in sound propagation, 454, 455 
— increase of, 105, 446, 493 
— maximum, 36, 37, 70, 121 
— microcanonical, 37 
— of a paramagnet, 283, 320 
— relation to H, 443, 476 
— residual entropy, 516-521 
entropy balance, 508 
entropy death, 507 
entropy flow, 444 
entropy of mixing, 115 
€-expansion, 382 
equation of continuity, 441, 449, 453, 
476 
— for the particle density, 413 
equation of state 


— in the molecular field approximation, 


295 
— magnetic, 293, 339, 340, 365 
— Mie-Griineisen, 212 


— of a molecular gas, 226, 234 

— of ideal gas, 47 

— van der Waals, 112 
equilibrium 

— chemical, 150 

— local, see local equilibrium 

— thermodynamic, 120, 150 
equilibrium conditions, 120-146 
equilibrium distribution function 
— local, 449 

equilibrium state, 26 

ergodic, 497 

ergodic theorem, 25, 497 

Euler’s equation, 453 
Euler—MacLaurin summation formula, 


228 
eutectic, 149 
— table, 561 


eutectic point, 148 

evaporation curve, 130, 132 

Ewald method, 314 

exact (perfect) differential, 85, 87 

exact differential, 86, 160 

exchange 

— direct, 288 

— indirect, 288 

exchange corrections, 176 

— to the second virial coefficient, 241, 
531 

exchange interaction, 287-289 

exothermic reaction, 156 

exp-6-potential, 240 

expansion 

— adiabatic, 99, 126 

—— Third Law, 515 

— isothermal, 98, 126 

expansion coefficient (thermal), 88 

— at absolute zero, 515 

— linear, 559 

— numerical values for various 
materials, 559 

expansion of a gas 

— and irreversibility, 97, 500 

expectation value, see mean value 

exponents, critical, see critical 
exponents 

extremal properties, 120-122, 275 


Fermi 

— distribution function, 171, 179 
— energy, 177, 184, 221 

— gas, 176-185 

—— nearly degenerate, 176 
liquid, 186 
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—— Landau theory of, 188 

— momentum, 177 

— sphere, 177 

— temperature, 181, 182, 184 

Fermi—Dirac statistics, 170, 278 

fermions, 170, 278 

— interacting, 185 

ferrimagnet, 336 

ferroelectric, 336 

ferromagnet, 287-307, 333, 336, 361 

— isotropic, 337, 429 

— planar, 338 

— uniaxial, 337, 338 

ferromagnetism, 287-307 

fields 

— irrelevant, 355 

— relevant, 355 

fixed point, 345, 347, 352, 356, 384, 401 

flow diagram, 351 

flow term, 439, 456, 476 

fluctuation-dissipation theorem, 556 

fluctuation-response theorem, 90, 300, 
326 

fluctuations, 6, 312, 366 

— in Gaussian approximation, 406 

— of energy, 89-90 

— of particle numbers, 205, 494 

— time interval of large fluctuations, 
494-497 

Fokker—Planck equation 

— for a free particle, 416-418 

—— solution of, 420 

— for particles in a force field, 420 

— for stock-market prices, 436 

fractal dimension, 399, 402 

free energy, 59, 77, 274-275, 310, 311 

— canonical, 272 

— convexity of, 139 

— Helmholtz, 272 

free enthalpy, 145 

— concavity of, 139 

free enthalpy (Gibbs’ free energy), 78 

freezing mixture, 149 

— table, 561 

freezing-point curve, 148 

freezing-point depression, 137, 263 

frequency matrix, 556 

fugacity, 68, 172, 191, 237 

functional integration, 415 


gain processes, rate 
— in the collision term of the Boltzmann 
equation, 439, 473 
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Galilei transformation, 217 

Galton board, 22 

I space, 9 

gas 

— adiabatic expansion of, 95, 99 

— ideal, see ideal gas 

— ideal molecular gas, see ideal 
molecular gas 

— isothermal expansion of, 98 

— real, see real gas 

— reversible expansion of, 98 

gas constant, 558 

Gaussian approximation, 366, 372 

Gaussian distribution, 23 

Gaussian integral, 32 

Gay-Lussac experiment, 95, 494 

— irreversible, 95, 119 

— reversible, 98, 99 

Gibbs distribution, 64 

Gibbs free energy, 78 

Gibbs’ entropy, 479, 498 

— time independence of, 511 

Gibbs’ paradox, 27, 117, 526 

Gibbs’ phase rule, 146-150 

Gibbs—Duhem relation, 81, 145 

— differential, 81 

Gibbs-Duhem relation, 166 

Ginzburg-Landau approximation, 
364 

Ginzburg-Landau functionals, 361 

Ginzburg-Landau model 

— time-dependent, 427 

Ginzburg-Landau theory, 361, 404, 
538-544 

Ginzburg—Levanyuk temperature, 373 

Goldstone modes, 406 

Griineisen constant, 212 

grand canonical density matrix 

— in the second quantization, 69 

grand canonical potential, 146 

grand partition function, 64 

— of an ideal quantum gas 

—— in the second quantization, 172 

— of the ideal quantum gas, 170 

grand potential, 65, 78 

— of the ideal quantum gas, 169, 171 

gravitational instability, 508, 509 

growth processes, 387 

gyromagnetic ratio, 270 


H-theorem, 443-446, 480 
Hamiltonian, 11 

— of the dipole interaction, 307, 314 
— of the exchange interaction, 288 
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Hamiltonian equations of motion, 11 

Hamiltonian operator, 11 

hard core potential, 239 

harmonic oscillators, 9, 325, 433, 481, 
484 

— non-coupled quantum-mechanical, 
33, 48 

heat, 60, 61, 76 

— latent, see latent heat 

heat bath, 50 

heat capacity, 82, 88 

— see also specific heat 

— at absolute zero, 515 

heat conductivity, 433, 463, 477 

heat current density, 450, 463 

heat death, 507 

heat diffusion, 463, 466, 467 

heat input, 76 

heat of melting, 265 

— numerical values for some materials, 
560 

heat of reaction, 153, 156 

heat of vaporization 

— numerical values for some materials, 
560 

heat pump, 126 

heat transfer, 62 

heating 

— of a room, 117 

heating efficiency, 128 

Heisenberg model, 288, 328 

— anisotropic, ferromagnetic, 337 

— at absolute zero, 520 

— isotropic, 337, 338 

helical phases, 336 

*He, 137, 186, 187 

— melting curve, 323, 324 

— phase diagram of, 186 

“He, 187, 196, 213-221 

— phase diagram of, 196 

He I-He II transition, 337 

He II, 213-221 

— excitation spectrum of, 214 

— quasiparticles in, 213 

Helmholtz free energy, 77, 272, 296 

Hertzsprung—Russell diagram, 188 

high-temperature fixed point, 352 

Holstein—Primakoff transformation, 328 

homogeneous function, 397, 404 

Hubbard model, 224, 289 

Hubbard-Stratonovich transformation, 
407 

Hund’s rules, 278 


hydrodynamic equations, 460-468 

— microscopic derivation of, 490, 
552-556 

— phenomenological discussion of, 549 

— solution of, 466-468 

hydrodynamic limit, 460-466 

hydrodynamic variables, 460 

hydrodynamics, 425, 451, 548-556 

— of a ferromagnet, 549, 556 

hydrogen bonds, 406, 518 

hydrogen in metals, 262 

hydrogen-oxygen reaction, 153 

hyperscaling relation, 344 

hypersphere, 32 

hysteresis behavior 

— in a first-order phase transition, 405 


ice, 136, 517-519 

— regelation of, 137 

ideal gas, 39, 46-48, 558 

— caloric equation of state of, 46, 558 

— classical, 30, 67, 89, 437 

— thermal equation of state of, 47 

ideal gases 

— reactions of, 152 

ideal molecular gas, 225-236 

— chemical potential of, 226 

— free energy, 226 

— influence of the nuclear spin, 232-233 

internal energy of, 226 

— mixtures of, 234-236 

ideal quantum gas, 169-221 

— classial limit, 175 

— of free particles, 173 

integrability conditions, 84, 86 

internal combustion engine, 125 

internal degree of freedom, 225, 226 

internal field, 314 

inversion, 323 

inversion curve, 111, 112, 266 

irreversibility, 97, 443-446, 479-509 

— and time reversal, 500-509 

— and external perturbations, 503 

— from microscopic equations of motion 
in the limit of infinitely many degrees 
of freedom, 484 
in quantum mechanics, 491—494 

irreversible changes, see process 

irreversible process, see process 

Ising model, 289-304, 327 

— Ginzburg-Landau functional, 538 

— one-dimensional, 326, 346, 545-547 

— two-dimensional, 349, 547 

isobar, 47 
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isochore, 47 

isotherm, 47 

isotropic ferromagnet, 429 
isotropic Heisenberg model, 338 


Jacobians, 87 

Joule cycle, 162 
Joule-Thomson coefficient, 111 
Joule-Thomson process, 110 


kinetic momentum, 269 

kinetic theory, 437 

Kubo formula, 490, 556 

Kubo relaxation function, 490, 550-551 


lambda point, 195 

lambda transition, 337 

Landau diamagnetism, 286 

Landau quasiparticle interaction, 286 

Landau—Ginzburg—Wilson functional, 
380 

Landau-—Lifshitz equations, see Bloch 
equations 

Landau-—Placzek peak, 467 

Landau—Placzek ratio, 467 

Landé g-factor, 270, 280 

Langevin diamagnetism, 279 

Langevin equation, 409-416 

Langevin function, 72, 282, 319 

lasers, 322 

latent heat, 135, 520 

— see also heat of melting, heat of 
vaporization 

— in the van der Waals theory, 255 

lattice vibrations, 208 

— see also phonons 

lattice-gas model, 408 

law of mass action, 150-155, 163 

Law of Thermodynamics 

— First, 1, 44, 60, 66, 76, 104, 106, 107, 
145, 276 

—— for magnetic systems, 272 

— Second, 1, 76, 104, 106, 108 

— Third, 109, 513-521 

— Zeroth, 109 

Legendre transformations, 79 

Lennard—Jones potential, 238, 240 

Le Chatelier’s principle, 124 

limiting dimension, see critical 
dimension 

linear chain, 206, 481, 509 

linear response, 550 

Liouville equation, 11, 12, 511 
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liquid-gas transition, see liquid-vapor 
transition, 250, 333 

liquid-vapor transition, 130, 132 

local equilibrium, 448, 451 

local field, 311 

local temperature 

— equation of motion for, 462 

log-normal distribution, 23 

logarithmic corrections, 385 

long time tails, 442 

longitudinal susceptibility, see 
susceptibility 

Loschmidt’s number, 558 

Loschmidt’s paradox, 479, 500, 503 

loss processes, rate 

— in the collision term of the Boltzmann 
equation, 439, 472 

low-temperature fixed point, 352 

low-temperature physics, 138 


macrostate, 3, 9, 25, 498 

magnetic moment 

classical, 282 

— of a body, 272 

of the electron, 270 

total, 270 

magnetization, 272 

— as a hydrodynamic quantity, 548 

— in Pauli paramagnetism, 285 

— in the one-dimensional Ising model, 
546 

— spontaneous, 287, 293, 364 

magnetization fluctuations, 374 

magnetomechanical ratio, see gyromag- 
netic ratio 

magnons, 213, 328 

main sequence, 188 

Markov process, 422 

master equation and irreversibility 
in quantum mechanics, see Pauli’s 
master equation 

Maxwell construction, 249 

Maxwell distribution, 53, 438, 445, 456, 
476 

— integrals, 547 

— local, 448, 452, 476, 478 

Maxwell potential, 478 

Maxwell relations, 84, 273, 275 

mean field, 290, 298 

mean free path, 443 

mean square deviation, 5, 8, 41, 42 

mean value, 4, 5, 27, 51 

melting curve, 130 

melting-point depression, 265 
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metastable states, 516 

microstate, 3, 9, 25, 498 

Mie—Grtineisen equation of state, 212 

mixing, 497 

mixtures, 144, 152, 166, 234, 257-266 

mobility, 413 

molar heat capacity, 83 

molar volume, 92, 154, 558 

mole, 92, 558 

molecular chaos, 442 

molecular crystals (residual entropy), 
516 

molecular field approximation, 289-300 

molecular gas, 152 

moments, 4, 8 

momentum conservation, 441 

momentum current, 477 

momentum density, 447 

— equation of motion of, 465 

momentum-shell renormalization 
group, 380 

monomers, 317 

motion reversal, see time reversal 

Lt space, 438, 505 


N-particle distribution function, 438 

Néel temperature, 287 

Natterer tube, 333 

natural variables, 79 

Navier-Stokes equations, 451 

negative temperature, 320-322 

Nernst’s theorem, 109, 513-521 

neutron scattering, 304 

neutron stars, 187, 188 

noise 

— electrical, 434 

noise voltage, 434 

non-equilibrium state, 121 

non-integrability, 86 

normal conductor—superconductor 
transition, 337 

nuclear matter, 187 

nuclear spin 

— in a magnetic field, 322 

number of clusters, 396 


occupation number, 170, 171, 200, 215 

order, 507 

— ferromagnetic, 333 

order parameter, 193, 214, 293, 337, 
390 

order—disorder transition, 337 

Ornstein—Zernike Correlation Function, 
301, 303, 3438, 371 


ortho hydrogen, 232 
osmotic pressure, 261 


para hydrogen, 232 

paraelectric—ferroelectric transition, 
337 

paramagnet, 34 

— classical, 325 

paramagnet—antiferromagnet transi- 
tion, 337 

paramagnet—ferromagnet transition, 
337 

paramagnetism, 280-283, 320 

parameter flow, 345 

partial pressure, 153, 155 

particle number, 494 

particle-number density, 447 

— equation of motion for, 461 

— local, 448 

particle-number operator, 69 

partition function, 51, 52, 59 

— canonical, 52 

— grand, 64 

— in the magnetic field, 276 

partition integral, 526 

— derivation from the partition 
function, 525 

path integral, see functional integration 

Pauli paramagnetism, 284-287 

Pauli’s master equation, 491-494 

percolation, 387-402 
— bond percolation, 389 

— cluster, 388 

= euntind ai percolation, 388 

— correlation length, 390 

— critical exponents, see critical 
exponents 

— order parameter, 390 

— percolation threshold, 387, 388, 391 

— percolation transition, 390 

— Potts model, 391 

— radial correlation function, 392 

— site percolation, 389 

percolation behavior 

— critical, 398 

percolation threshold, 387, 388, 391, 
401 

percolation transition, 390 

perfect differential, see exact differen- 
tial, 86 

perpetual motion machine 

— of the first kind, 107 

— of the second kind, 2, 108 


perturbation expansion, 536-537 

perturbations 

— irrelevant, 356 

— relevant, 356 

phase boundary curve, 334 

phase space, 9 

phase transition, 132, 140, 331, 332, 
336, 337 

— antiferromagnetic, 336 

— continuous, 332 

— correlation length, 339 

— critical exponents, see critical 
exponents 

— critical point, 332, 334 

— Ehrenfest’s classification, 332 

— ferromagnetic, 333 

— first order, 377 

— helical phases, 336 

— liquid-gas transition, 333 

— n-th order, 332 

— of a molecular zipper, 406 

— of first order, 332, 373, 405 

— of second order, 332 

— order parameter of, 337 

— phase boundary curve, 334 

— power laws, 334 

— separation, 336 

— surface of equation of state, 334 

— tricritical, 404 

— vaporization, 333 

— vaporization curve, 333 

phase-boundary curves, 130-139 

— slope of, 135 

phases, 130, 331 

— coexistence, 130 

— in equilibrium, 130 

phonon dispersion relation, 210 

phonons, 206-221, 223 

— acoustic, 208, 467 

— damping, 434 

— optical, 208 

photon gas, 197-205 

— chemical potential of, 205 

Planck’s quantum of action, 445 

Planck’s radiation law, see radiation 
law 

Poincaré 

— recurrence time, see recurrence time 

— recurrence-time theorem, 481 

Poisson brackets, 12, 523 

polyethylene, 317 

polymers, 317-320, 329, 387 

polystyrene, 317 
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Pomeranchuk effect, 137, 164, 324 

potential 

— Buckingham-, see Buckingham 
potential 

— exp-6-, see exp-6-potential 

— grand, 65, 78, 276 

— grand canonical, 146 

— hard-core, see hard-core potential 

— Lennard-Jones, see Lennard—Jones 
potential 

potentials, thermodynamic, see 
thermodynamic potentials 

Potts model, 391 

pressure, 44, 45, 56, 64, 557 

pressure tensor, 450, 465, 477 

probability, 4 

— conditional, 6, 421 

probability density, 4 

— characteristic function, 5 

— moments of, 4, 5 

probability density of F(X), 5 

process 

— adiabatic, 93, 99, 107 

— cyclic, see cyclic process 

— irreversible, 94-97, 104, 479 

— isentropic, 93 

— isobaric, 93 

— isochoral, 93 

— quasistatic, 93, 109, 110 

— real and reversible, 100 

— reversible, 94, 98-100 

— thermally isolated, 93 

pure system, 130 


quantum corrections, 176, 241, 532-536 
quantum liquid, 187 

quasi-equilibrium states, 502, 504 
quasi-ergodic theorem, 497 
quasiparticles, 189, 190, 213, 215, 217 
quasistatic process, see process 


radial correlation function, 392 

radiation law 

— Planck, 201 

— Rayleigh—Jeans, 202 

— Wien, 202 

radiation pressure, 200, 508 

radius of gyration, 318 

random motion, see random walk 

random variable, 4-6 

random walk, 7, 21 

Raoult’s law, 266 

Rayleigh—Jeans radiation law, see 
radiation law 
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reaction coordinate, 422 

reaction equations, 150 

reaction equilibrium, 154 

reaction rates, 422, 425 

real gas, 236-257 

— see also van der Waals theory 

— caloric equation of state for, 245 

— free energy of, 244 

— thermal equation of state for, 244, 
245 

real-space renormalization procedures, 
see real-space transformations 

real-space RG transformations, general, 
346, 359 

recurrence time, 479, 481-484, 488, 510 

refrigerator, 126 

relaxation-time approximation, 
468-469, 477 

— for the electrical conductivity, 478 

relevant fields, 355 

relevant perturbations, 356 

renormalization group, 345-387 

renormalization group theory, 345-387 

— e€-expansion, 382 

— block transformation, 360 

— configuration-space RG transforma- 
tions, 346 

— coupling constants, 345, 346, 348, 
351 

— critical dimension, 373, see limiting 
dimension, 383 

— critical exponents, 354 

— critical point, 351 

— critical trajectory, 353 

— crossover phenomena, 386 

— cutoff length scale, 345 

— decimation transformation, 346-354 

— effective exponents, 386 

— fields 

—— irrelevant, 355 

—— relevant, 355 

— fixed point, 345, 347 

— flow diagram, 351 

— flux lines, 351 

— high-temperature fixed point, 352 

— Ising model 

—— one-dimensional, 346, 545 

—-— two-dimensional, 349 

— Landau-Ginzburg—Wilson functional, 
380 

— logarithmic corrections, 385 

— low-temperature fixed point, 352 


— momentum-shell renormalization 
group, 380 

— real-space RG transformations, 
345-354, 359 

— renormalization group transforma- 
tion, 345 

— renormalization group transforma- 
tions, 346 

— renormalization transformations, 400 

— RG flow, 381 

— scaling fields, 357 

— two-point function, 383 

— universality properties, 358 

— Wick’s theorem, 383 

— Wilson’s RG scheme, 380 

residual entropy at absolute zero, 
516-521 

reversible changes, see process 

reversible process, see process 

RG flow, 381 

Riemann ¢-function, see ¢-function 

RKKY- interaction, 288 

root mean square deviation, 5 

rotational degrees of freedom, 117, 
227 

rotational energy, 227 

rotational invariance, 338, 339 

rotons, 214 

rubber-like elasticity, 317-320 


Sackur—Tetrode equation, 46 

saturation curve, 132 

saturation magnetization, 316 

saturation region, 132 

scalar product, 457, 476, 553 

scale invariance, 305 

scale transformations, 341 

scaling fields, 357 

scaling functions, 340 

scaling hypothesis, 343 

— for the correlation function, 343 

— static, 339 

scaling laws, 341, 344 

— hyperscaling relation, 344 

scaling relations, 343 

scaling theory, percolation, 398-399 

scattering 

— and the collision term of the 
Boltzmann equation, 469-475 

— inelastic, 467 

— of two hard spheres, 472, 475 

scattering cross-section, 441, 469-475, 
478 


— differential, 471 

— elastic, 304 

scattering phase 

— and second virial coefficient, 532 

Schottky anomaly, 283 

Schrodinger equation, 15 

— relation to the Smoluchowski 
equation, 430 

second viscosity, see bulk viscosity 

self-avoiding random walk, 387 

self-organized criticality, 387 

separation of time scales, 426 

separation transition, 336 

shape dependence, 309 

shear viscosity, 465, 477 

Sievert’s law, 263 

single-particle distribution function, 
438 

site percolation, 389 

Smoluchowski equation, 418-419 

— and supersymmetric quantum 
mechanics, 429-432 

— relation to the Schrédinger equation, 
430 

— solution for a harmonic potential, 
421 

solutions, see dilute solutions 

Sommerfeld expansion, 179, 285 

sound damping, 476 

sound propagation 

— in gases, 453-455 

sound velocity, adiabatic, 454, 477 

specific heat, 83, 90, 273, 310 

— and fluctuations of internal energy, 
90 

— at absolute zero, 514 

— at constant pressure, 83, 256, 466 

— at constant volume, 83, 253, 463 

— in the one-dimensional Ising model, 
546 

— in the van der Waals theory, 253 

— negative, 508 

— numerical values for various 
materials, 559 

— of a paramagnet, 322 

— of a solid, 186, 211-213, 223 

— of ideal molecular gas, 231 

— rotational contribution to, 229, 233 

— vibrational contribution to, 230 

spectroscopic splitting factor, see 
Landé-g-factor 

spin waves, 328, 429 

spin-orbit coupling, 280 
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spin-spin correlation function, 405 

— see also Ornstein—Zernike correlation 
function 

— in the one-dimensional Ising model, 
546 

spinwaves, 556 

spontaneous magnetization, 287, 293, 
364 

stability, 90, 124 

— mechanical, 124 

— thermal, 124 

staggered magnetization, 336, 337, 548 

standard deviation, 5 

state functions, 85, 86 

state of aggregation, 331 

— see also phase 

state variables, 1 

— extensive, 81, 93 

— intensive, 81, 93 

static scaling hypothesis, 339 

stationarity, 122, 123 

statistical operator, see density matrix 

steam engine, 125 

Stefan—Boltzmann law, 200 

Stirling formula, 21, 31 

stochastic equation of motion, 410 

stochastic force, 409 

stochastic process, 410 

stock-market prices 

— as a stochastic process, 435 

stoichiometric coefficients, 150 

Stosszahlansatz, 440, 442 

sublimation, 138 

sublimation curve, 130, 138 

Sun’s temperature, 205 

superconductivity, 336, 364 

supercooling, 379 

superexchange, 289 

superfluidity, 187, 217, 336 

superheating, 379 

supersymmetric quantum mechanics, 
429, 435 

surface of equation of state, 47, 132, 
334 

surface of states, 132 

susceptibility, 84, 300-301 

— adiabatic, 274 

— diamagnetic, 283 

— dielectric, 325 

— dynamic, 550 

— for Pauli paramagnetism, 285 

— in the molecular field approximation, 
302 
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— in the one-dimensional Ising model, 
546 

— isothermal, 273, 293 

— longitudinal, 375 

— molar, 279 

— of harmonic oscillators, 325 

— paramagnetic, 283 

— rel. to applied field, 309 

— rel. to internal field, 309 

— transverse, 372, 373 

symmetry 

— continuous, 373 

— discrete, 338 

symmetry properties, 331 

system 

— interacting, 42 

— isolated, 26 

— multicomponent, 144 

— single-component, 130 


temperature, 38, 39, 45, 64 

— absolute, 91, 143 

— absolute zero of, 513 

— definition of, 39 

— empirical, 91 

— local, 448 

— negative, 320-323 

— scale, 557 

— scale, absolute, 91 

temperature equilibration, 113 

— quasistatic, 109 

temperature fixed point, 143 

thermal efficiency, 127 

thermal pressure coefficient, 84, 88 

thermal wavelength, 67, 95, 204, 437, 
527 

thermalization, 71 

thermodynamic inequalities, 123-124, 
274 

thermodynamic potentials, 75-80, 
144-146, 271-276 

— extremal properties of, 120-122 

— table of, 80 

thermodynamic processes, 92 

thermodynamic quantities 

— derivatives of, 82 

thermodynamic inequalities, 89 

time average, 497 

time reversal 

— and irreversibility, see irreversibility 

— in the Boltzmann equation, 446-447 

time-reversal invariance, 479 

time-reversal transformation, 480, 501 


time-scale separation, 461, 490 

tin (allotropy of), 520 

total (exact) differential, 84 

trajectory, critical, 353 

transfer matrix method, 349, 518, 
545-547 

transition 

— thermally activated, 422 

transition probability 

— in the Boltzmann equation, 440, 469, 
475 

transition rate, see reaction rate, 425 

translational degree of freedom, 225 

translational energy, 225 

transverse susceptibility, see suscepti- 
bility 

tricritical point, 405 

triple line, 132 

triple point, 132, 141-144 

— center-of-gravity rule for, 143 

triple-point cell, 144 

two-fluid model, 217-221 

two-level systems, 34, 48, 320-322, 326 

two-phase region 

— in the van der Waals theory, 250 

two-point function, 383 


uniaxial ferromagnet, 338 

unit cell, 208 

universality, 299, 338-339, 358 
universality classes, 338 

urn model, 511 

— and the H theorem, 512 

— and the Fokker—Planck equation, 512 
— and the Langevin equation, 512 

— and the paramagnet, 511 


van der Waals 

— coefficients, 246 

—— numerical values for some materials, 
560 

— equation of state, 245, 256 

—— dimensionless, 251 

— isotherm, 246, 248 

— §-shaped isotherm, 246 

— theory, 242-257 

van Hove singularities, 210, 211 

Van Vleck paramagnetism, 284 

van’t Hoff formula, 261, 267 

vapor, 132 

vapor pressure, 130, 138, 561 

vapor pressure curve, 130-132 

— see also evaporation curve 

— in the van der Waals theory, 253, 256 


vapor pressure increase 

— by surface tension, 159-160 

vapor pressure increase by other gases, 
156-159 

vapor pressure reduction, 263, 266 

vaporization curve, 333 

vaporization transition, 333, 337 

variables of state 

— see also state functions 

variance, 5 

— see also mean square deviation 

velocity 

— equation of motion for, 461 

— local, 448 

vertex model, 518 

vibrational degree of freedom, 230 

vibrational energy, 227 

virial, 55 

virial coefficient, 237 

— classical approximation for, 238-241 

— for Lennard—Jones potential, 240 

— quantum corrections to, 241, 531-536 

virial expansion, 236-242 

virial theorem 

— classical, 54 

— quantum statistical, 57 
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virtual process, 121 
viscosity, 477 

— see also shear viscosity 
Von Neumann equation, 15 


water, 134 

— see also ice, vapor pressure of 
— anomaly, 136 

Weiss model, 328 

white dwarfs, 188, 222 

Wick’s theorem, 383 

Wien’s displacement law, 201 
Wien’s law, see radiation law 
Wigner function, 523 

Wilson’s RG scheme, 380, 422 
work, 44, 60, 62, 76, 98, 100, 125 
work engine, 126 

work performance, see work 


Yukawa potential, 307 


Zeeman effect, 280 

Zermelo’s paradox, 479, 481, 483 
¢-function, 199 

— and Bernoulli numbers, 537 

— generalized, 173 
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